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Abstract 

We consider simple random walk on a critical Galton- Watson tree conditioned to have 

height greater than n. It is well known that the cardinality of the set of vertices of the tree at 

generation n is then of order n. We prove the existence of a constant /3 w 0.78 such that the 

£\j hitting distribution of the generation n in the tree by simple random walk is concentrated 

with high probability on a set of cardinality approximately equal to r\r . In terms of the 

<**> I analogous continuous model, the dimension of harmonic measure of a level set of the tree is 

-^ equal to /?, whereas the dimension of any level set is equal to 1. The constant j3 is expressed 

in terms of the asymptotic distribution of the conductance of large critical Galton- Watson 

trees. 
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P^ 1 Introduction 

Oh 

From a probabilistic point of view, the harmonic measure on a set is the hitting distribution of 
that set by random walk, in the discrete setting, or by Brownian motion, in the continuous 

S setting. Harmonic measure has been studied in depth both in harmonic analysis and in 

probability theory, and it would be hopeless to try to survey the literature on this subject. 
It has been observed in different contexts that harmonic measure on a fractal set is often 
-H supported on a subset of strictly smaller dimension. For example the famous Makarov 

J> theorem [5T] states that harmonic measure on the boundary of a simply connected planar 

domain is always supported on a subset of Hausdorff dimension equal to 1 , regardless of the 
dimension of the boundary (see [2] for similar results in a discrete setting and pr for higher 
dimensional analogs). 

This "dimension drop" phenomenon also appears in the context of discrete random trees. 
In |18j . Lyons, Pcmantlc and Peres studied the harmonic measure at infinity for simple ran- 
dom walk on a supercritical Galton-Watson tree and proved that the harmonic measure is 
supported on a boundary set of dimension strictly less that the dimension of the whole bound- 
ary. The same authors then extended this result to biased random walk on a supercritical 
Galton-Watson tree [H] . 

This article is concerned with the asymptotic study of harmonic measure on generation n 
of a critical Galton-Watson tree, whose offspring has finite variance and which is conditioned 
to have height greater than n. In this setting, we prove again that the harmonic measure is 
supported on a "small" subset of the boundary. More specifically, we establish the existence 
of a constant j3 ~ 0.78, which does not depend on the offspring distribution, such that most 
of the harmonic measure on generation n of the tree is concentrated on a set of approximately 
•nP vertices, with high probability. This should be contrasted with the fact that the generation 
n of the tree has about n vertices. The constant /? has an explicit expression in terms of the 
law of a random variable C, which is the limit in distribution of the (scaled) conductance of 
the tree between the root and generation n - again this limiting distribution does not depend 
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on the offspring distribution, fn the related continuous model, we show that the Hausdorff 
dimension of the harmonic measure is almost surely equal to /3, whereas the dimension of 
the boundary is known to be equal to 1. Let us describe our results in a more precise way. 

Discrete setting. Let 9 be a probability measure on Z + , and assume that 9 has mean 
one and finite variance a 2 > 0. Under the probability P, for every integer n > 0, we let 
T^"' be a Galton-Watson tree with offspring distribution 9, conditioned on non-extinction 
at generation n. Conditionally on the tree T'"', we then consider simple random walk on 
T(™', starting from the root, and we let E n be the first hitting point of generation n by 
random walk. Our object of study is the "harmonic measure" /i„, which is the law of £„. 
Notice that fi n is a random probability measure supported on the set T„ of all vertices of 
"P") at generation n. By a classical theorem of the theory of branching processes, rT 1 ffJ n 
converges in distribution to an exponential distribution with parameter 2/cr 2 . 

Theorem 1. There exists a constant ft G (0, 1), which does not depend on the offspring 
distribution 9, such that, for every 5 > 0, we have the convergence in ¥ -probability 



fi n ({v G T<"> : n-P- 5 < fi n (v) < n -^+ 5 }) 



1. 



>oo 



Consequently, for every e G (0,1), there exists, with P-probability tending to I as n —} oo, a 

subset A n<e of Tn such that #A n ,E < n" + and ii n (A n ^ e ) > 1 — e. Conversely, the maximal 
fi n -measure of a set of cardinality bounded by n^ tends to as n — > oo, in ¥ -probability. 

Although we have no exact numerical expression for /3, calculations using the formulas 
in Proposition [3] below indicate that rs 0.78. See the discussion at the end of Section |3~f] 
This approximate numerical value confirms simulations made in physics |12j . 

The last two assertions of the theorem are easy consequences of the first one. Indeed, 
A„ )E := {v G T„ : fi n (v) > rT^^ s } has cardinality smaller than n l3+s , and the first 
assertion of the theorem shows that the /i„-measure of the latter set is greater than 1 — e 
with P-probability tending to 1 as n ~ > oo. On the other hand, if A is any subset of T„ 
with cardinality smaller than n@~ , we have 

Hn{£) < n n ({v G TW : n n (v) > n-P+ s l 2 }) + n^n-^' 2 

and the first term in the right-hand side tends to in P-probability by the first assertion of 
the theorem. 

Continuous setting. A key ingredient of the proof of Theorem fT] is a similar result in 
the continuous setting. A critical Galton-Watson tree conditioned on having height greater 
than n, viewed as a metric space for the graph distance normalized by the factor n^ 1 , is 
close in the Gromov-Hausdorff sense to a variant of Aldous' Brownian continuum random 
tree |2], also called the CRT. So a continuous analog of the harmonic measure /j, n would 
be the hitting distribution of height 1 by Brownian motion on the CRT starting from the 
root. Although the construction of Brownian motion on the CRT has been carried out in 
[T3] (see also [5] for a simpler construction), we will not follow this approach, because there 
is a simpler way of looking at the continuous setting. 

The point is that properties of the harmonic measure [i n on T„ can be read from the 
reduced tree T*" that consists only of vertices of T*-"^ that have descendants at generation 
n. In other words, we can chop off the branches of the discrete tree that do not reach the 
level n. Indeed, a simple argument shows that the hitting distribution of generation n is the 
same for simple random walk on T^") and on the reduced tree T* n . 

The scaling limit of the discrete reduced trees T*™ (when distances are scaled by the 
factor n~ l ) is particularly simple. We define a random compact M-tree by the following 




T i:: 1 '. l .v. -i ' ' "" 



Figure 1: A large (binary) Galton-Watson tree and the reduced tree at a given level 



device. We start from an (oriented) line segment whose length [7 is uniformly distributed 
over [0, 1] and whose origin will serve as the root of our tree. At the other end of this initial 
line segment, we attach the initial point of two other line segments with respective lengths 
U\ and Ui such that, conditionally given U , U\ and U% are independent and uniformly 
distributed over [0,1 — £/#]. At the other end of the first of these segments, respectively 
of the second one, we attach two line segments whose lengths are again independent and 
uniformly distributed over [0, 1 — U& ~ U%], resp. over [0, 1 — Uz — U2], conditionally on the 
triplet (Uz, U\, U%)- We continue the construction by induction and after an infinite number 
of steps we get a random (non-compact) rooted ]R-tree, whose completion is denoted by A. 
This is the scaling limit of the discrete reduced trees T* n . See Section [2T| for a more precise 
construction. 

The metric on A is denoted by d. By definition, the boundary <9A consists of all points 
of A at height 1, that is, at distance 1 from the root: These are exactly the points that are 
added when taking the completion in the preceding construction. 

It is then easy to define Brownian motion on A starting from the root and up to the first 
hitting time of dA (it would be possible to extend the definition of Brownian motion beyond 
the first hitting time of dA, but this is not relevant to our purposes). Roughly speaking, this 
process behaves like linear Brownian motion as long as it stays on an "open interval" of the 
tree. It is reflected at the root of the tree and when it arrives at a branching point, it chooses 
each of the three possible line segments incident to this point with equal probabilities. The 
harmonic measure /1 is then the (quenched) distribution of the first hitting point of dA by 
Brownian motion (see Section pTT] for details). 

Theorem 2. With the same constant /3 as in TheoremYn we have P a.s. fi(dx) a.e., 

\o%n(m d (x,r)) 



lim 

rj.0 



P, 



where 3@&(x, t) stands for the closed ball of radius r centered at x in the metric space (A, d). 
Consequently, the Hausdorff dimension of 11 is P a.s. equal to /?. 

The fact that the second assertion of the theorem follows from the first one is standard. 
See e.g. Lemma 4.1 in [T5] . 

Remark. It is not hard to prove that the Hausdorff dimension of dA (with respect to d) 
is a.s. equal to 1. An exact Hausdorff measure function is even given by Theorem 1.3 in 
Duquesne and Le Gall [7j. 

Let us give some ideas of the proof of Theorem [21 It is well known that one can turn the 
tree A, or rather the subtree A\<9A, into a "stationary" object via a logarithmic transfor- 
mation. Roughly speaking, we introduce a new tree which has the same binary branching 




Figure 2: A large reduced tree and the distribution of the harmonic measure on its 
boundary. Clearly the measure is not equally spread and exhibits a fractal behavior. 



structure as A, such that each point of A at height s G [0,1) corresponds to a point of the 
new tree at height — log(l — s) £ [0, oo). The resulting non-compact tree is called the Yule 
tree because it describes the genealogy of the classical Yule process, where individuals have 
(independent) exponential lifetimes with parameter 1 and each individual has exactly two 
offspring. We define the boundary of the Yule tree as the collection of all its geodesic rays, 
where a geodesic ray is just a semi-infinite geodesic path starting from the root. This bound- 
ary is easily identified with d A. An application of Ito's formula shows that the logarithmic 
transformation turns Brownian motion on A into a time-changed Brownian motion with drift 
1/2 on the Yule tree. Consequently, the probability measure \x corresponds via the preced- 
ing transformation to the distribution v of the geodesic ray that is "selected" by Brownian 
motion with drift 1/2 (that is, the unique ray of the Yule tree that is visited by Brownian 
motion at arbitrarily large times) . The first assertion of Theorem p] is then equivalent to 
proving that, P a.s., v(dy) a.e., 



lim - log^(£>(y,r)) 

r— >oo T 



(1) 



where B(y, r) denotes the set of all geodesic rays of the Yule tree that coincide with y up to 
time r. 

The next step is then to identify a kind of "stationary environment seen from the particle" 
for Brownian motion on the Yule tree. More precisely, we show in Section [3TT] that the law of 
the subtree above level r > that is selected by Brownian motion (with drift 1/2) converges 
as r — > oo to a limiting probability measure that we explicitly describe. This allows us 
to construct an ergodic invariant measure for the natural shifts on the space of all pairs 
consisting of a (deterministic) Yule-type tree and a distinguished geodesic ray on this tree, 
and moreover this measure is absolutely continuous with respect to the law of the random 
pair formed by the Yule tree and the ray selected by Brownian motion. The limiting result 
(fTl then follows from an application of Birkhoff's ergodic theorem to a suitable functional. 
In this part of our work, we use several ideas that have been developed by Lyons, Pemantle 
and Peres 18J in a slightly different setting. 

The random conductance. The constant /? in Theorems [I] and [2] can be expressed 
in terms of the (continuous) conductance of A. Roughly speaking, if one considers A as 
a network of resistors with unit resistance per unit length, then the effective resistance 
between height and height 1 is a random variable, which we denote by C. With this 
interpretation, it is clear that C > 1 a.s. Alternatively, C is the mass under the Brownian 
excursion measure from the root of those excursion paths that hit height 1. Note that C 



is also the limit in distribution of the (scaled) conductance between generations and n in 
T'"'. The distribution of C satisfies the following recursive equation 



C ® 



u- 



1-U x- 1 



Ci + C 2 



(2) 



where C\ and Ci are independent copies of C, and U is uniformly distributed over [0, 1] and 
independent of the pair (C\,C2)- Despite this rather simple recursive equation, the law "f(ds) 
of C is not completely understood (in particular, its mean is unknown). We prove that, 
although 7 has a continuous density / over [l,oo), the function / is not twice continuously 
differentiablc at the point 2 (and we expect a similar singular behavior at all integer values) . 
See Figure [3] 




Figure 3: A histogram of the distribution of 7 over (l,oo) from simulations based on 
the recursive equation (pi). There are explicit formulas for the density of 7 over [1,2] 
and over [2,3], which however depend on the (unknown) density at 1. The red and the 
blue curves correspond to these explicit formulas, with a numerical approximation of the 
density at 1. 



In many respects, the distribution 7 governs the behavior of the harmonic measure, 
particular the constant j3 has an explicit expression in terms of 7. 
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Proposition 3. The distribution 7 is characterized in the class of all probability measures 
on [l,oo) by the distributional equation Q2J) . The constant j3 appearing in Theoremsujandffl 

is given by 
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The paper is organized as follows. We start by studying the continuous model. In 
Section[2]we introduce the basic set-up and we relate the random tree A to the Yule tree. The 
law of the random conductance C is studied in Section [273] Section [3] gathers the ingredients 
of the proof of Theorem [2] In particular, Section |3~2"1 idcntifies the limiting distribution of the 
subtree above level r selected by Brownian motion, and Section [3.3| explains the application of 
the ergodic theorem needed to derive (111). Section Elis devoted to the proof of Theorem HI Let 
us emphasize that Theorem [T] is not a straightforward consequence of Theorem [2] and that 
the proof of our results in the discrete setting requires a number of additional estimates, even 



though a key role is played by Theorem[2] The last section is devoted to a few complements. 
In particular, we comment on the connection between the present paper and the recent work 
of Ai'dekon pQ. 

Acknowledgments. We would like to thank Thordur Jonsson for suggesting the study of 
the harmonic measure on critical Galton-Watson trees during Spring 2012. 

2 The continuous setting 

In this section we give a formal definition of the (continuous) reduced tree A. We then 
explain the connection between the reduced tree and the Yule tree. We finally introduce and 
study the conductance of these trees, which plays a key role in the next sections. 

2.1 The reduced tree A 

We set 

oo 

v=(J{MF 

where {1,2}° = {0}. If v = (vi, . . . ,v n ) € V, we set |u| = n (in particular, |0| = 0), and 
if n > 1, we define the parent of v as v — («i, . . . , w n -i) ( we then say that v is a child of 
v). If v = (vi, . . . , v n ) and v' — (v[, . . . , v' m ) belong to V, the concatenation of v and v' is 
vv' := (vi, . . . , v n , v'i, . . . , v' m ). The notions of a descendant and an ancestor of an clement 
of V are defined in the obvious way, with the convention that a vertex v £ V is both an 
ancestor and a descendant of itself. liv,w£V,v/\wis the unique element of V that is an 
ancestor of both v and w and such that \v A w\ is maximal. 
We then consider a collection 

of independent real random variables uniformly distributed over [0, 1] under the probability 
measure P. We set 

Y = U 

and then by induction, for every v £ {1, 2}™, with n > 1, 

Y v =Y ij + U v {l-Y % ). 
Note that < Y v < 1 for every v £ V, a.s. Consider then the set 

A := {0} x [O,F ] U |J {v} x (n,n]. 
nev\{0) 

There is a straightforward way to define a metric d on Ao, so that (Ao, d) is a (noncompact) 
K-tree and, for every x — (v,r) £ Ao, we have d((0,O),x) = r. To be specific, let x — 
[y, r) £ A and y = (w, r') £ A : 

• If v is a descendant of w or w is a descendant of v, we set d(a;, y) = \r — r'\. 

• Otherwise, d(x,y) = d((vAw,Y vAw ),x)+d((vAw,Y vAw ),y) = (r-Y vAw ) + (r' -Y vAw ). 
See Figure |4] for an illustration of the tree Ao. 

We let A be the completion of Ao with respect to the metric d. Then 

A = A U dA 

where by definition dA = {x £ A : d((0,O),x) = 1}, which is canonically identified with 
{1,2} N . Note that (A,d) is a compact E-tree. 
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Figure 4: The random tree Ac 



The point (0, 0) is called the root of A. For every x £ A, we set H(x) = d((0, 0), x) and 
call H (x) the height of x. We can define a genealogical order on A by setting x -< y if and 
only if x belongs to the geodesic path from the root to y. 

For every e £ (0, 1), we set 

A £ = {ieA: H(x) < 1-e}, 

which is also a compact M-tree for the metric d. The leaves of A e are the points of the form 
(v, 1 — e) for all v £ V such that Y$ < 1 — e < Y v . The branching points of A e are the points 
of the form (v,Y v ) for all v £ V such that Y v < 1 — e. We can then define Brownian motion 
on A £ as a special case of a diffusion on a graph (see in particular [TD] , [8] and the references 
therein) . Informally, this process behaves like linear Brownian motion as long as it stays on 
an "open interval" of the form {v} x (Yy,Y v A (1 — e)). It is reflected at the root (0,0) and 
at the leaves of A e , and when it arrives at a branching point of the tree, it chooses each of 
the three possible line segments ending at this point with equal probabilities. 

Write B e = (i?^) f > for Brownian motion on A £ starting from the root, and let 



T £ := inf{i > : H{B £ t ) = 1 - s}, 

be the hitting time of the set of all leaves of A e . The process B e is defined under another 
probability measure P (for our purposes, it will be important to carefully distinguish the 
probability measure P governing the random trees and the one governing Brownian motions 
on these trees). 

If we now set e„ = 2~™ for every n > 1, we may define all processes B 6n on the same 



probability space, in such a way that B 



t/\T E , 



B 



tAT., 



for every t > and every choice of 



m < n, P a.s. Assuming that the latter property holds, we set 



T = lim t T En 

and we define the process (B t ) t >o by requiring that _B t = ^ if i > T (where £ is a cemetery 

— B^ T . It is easy to verify that the left 



point) and, for every n > 1 and t > 0, B t ^r c 
limit 



lim B t 

t^T,t<T 



exists in A and belongs to dA, P a.s. The harmonic measure // is the distribution of Bl- 
under P, which is a (random) probability measure on dA = {1, 2} N . 



2.2 The Yule tree 

For the proof of Theorem [21 it will be more convenient to reformulate the problem in terms 
of Brownian motion on the Yule tree. To define the Yule tree, consider now a collection 

(Vv)vev 

of independent real random variables exponentially distributed with mean 1 under the prob- 
ability measure P. We set 

Z = Vgs 

and then by induction, for every v € {1, 2}", with n > 1, 

Z v = Z v + V v . 
The Yule tree is the set 

T := {0} x [0, Z ] U |J {v} x (Z t ,Z v ], 

v<EV\{0} 

and is equipped with the metric d defined in the same way as d in the preceding section. For 
this metric, T is again a non-compact K-tree. For every x — (v, r) € T, we keep the notation 
H(x) = r = d((0, 0),x) for the height of the point x. 

Now observe that if U is uniformly distributed over [0, 1], the random variable — log(l — U) 
is exponentially distributed with mean 1 . Hence we may and will suppose that the collection 
(^ti)uev is constructed from the collection (U v ) ve \> in the previous section via the formula 
V v = — log(l — U v ), for every v e V. Then, the mapping "J defined on A by ^(v,r) = 
(t>, — log(l — r)), for every (u, r) e Ao, is a homeomorphism from Ao onto T. 

Stochastic calculus shows that we can write, for every t € [0,T), 

*(B t ) = w( J (1 - H(B S ))- 2 ds) (4) 

where (W(t)) t >o is Brownian motion with constant drift 1/2 (towards infinity) on the Yule 
tree (this process is defined in a similar way as Brownian motion on A e , except that it 
behaves like Brownian motion with drift 1/2 on every "open interval" of the tree). Note that 
again W is defined under the probability measure P. From now on, when we speak about 
Brownian motion on the Yule tree or on other similar trees, we will always mean Brownian 
motion with drift 1/2 towards infinity. 

By definition, the boundary of T is the set of all all infinite geodesies in T starting from 
the root (0,0) (these are called geodesic rays). The boundary of T is canonically identified 
with {1, 2} N . From the transience of Brownian motion on T, there is an a.s. unique geodesic 
ray denoted by W^, that is visited by (W(t),t > 0) at arbitrarily large times. We sometimes 
say that Woo is the exit ray of Brownian motion on T. The distribution of W^ under P yields 
a probability measure v on {1, 2} N . Thanks to Q, we have in fact v = /i, provided we view 
both jjl and v as (random) probability measures on {1, 2} N . The statement of Theorem pi is 
then reduced to checking that (fll holds v(dy) a.e., P a.s. 

Yule-type trees. Our proof of (111) makes a heavy use of tools of ergodic theory applied to 
certain transformations on a space of trees that we now describe. We let T be the set of all 
collections (z v ) v£ \> of positive real numbers such that the following properties hold: 

(i) Zy < z v for every v £ V\{0}; 

(ii) for every v= (v 1 ,v 2 ,---) e {1,2} N , 

lim Z( vlt ... iVn ) = +oo. 



Notice that we allow the possibility that z = 0. We equip T with the ct- field generated by 
the coordinate mappings. If (z v ) veV e T, we can consider the associated "tree" 

T:={0}x [0,z z ]U (J {i)}x(zo,4 
»ev\{0} 

equipped with the distance defined as above. We will keep the notation H (x) = r if x = (v, r) 
for the height of a point x G T . The genealogical order on T is defined as previously and 
will again be denoted by -<. If u = (u 1; u 2 , • ■ ■ , u n , . . .) e {1, 2} N , and a; = (v, r) e T, we 
write x -< u if w = (ui, U2, ■ ■ ■ , Ufe) for some integer fc > 0. 

We will often abuse notation and say that we consider a tree Tel: This really means 
that we are given a collection (z„)„ 6 y satisfying the above properties, and we consider the 
associated tree T ■ In particular T has an order structure (in addition to the genealogical 
partial order) given by the lexicographical order on V. Elements of T will be called Yule-type 
trees. 

Clearly, the Yule tree can be viewed as a random variable with values in T, and we write 
O(dT) for its distribution. 

Let us fix T G T. If r > 0, the level set at height r is 

T r = {xeT: H{x) = r}. 

If x G %., we can consider the subtree T[x\ of descendants of x in T. Formally, we view 
T[x\ as an element of T: We write v x for the unique element of V such that x = (v x , r), and 
define T[x] as the Yule-type tree corresponding to the collection {z VtV — r) ve y. Similarly, 
if [[0,x]] denotes the geodesic segment between the root and x, we can define the subtrees 
of T branching off [[0, x]]. To this end, let n x — \v x \ and let v x .o — 0, v x> \, . . . , v x , n ^ = v x 
be the successive ancestors of v x from generation to generation n x . Set r xi — z Vx i _ 1 for 
every 1 < i < n x . Then, for every 1 < i < n x , the z-th subtree branching off [[0,a;]], which is 
denoted by T x .i, corresponds to the collection 



\^Vx,iV x,i) 



vev 



where v x _i is the child of v x ,i—i that is not v x ,i- To simplify notation, we introduce the point 
measure 

Tl x 

Cr,x(T) = y]S(r x , t ,T x ,i), 
i=l 

which belongs to the set A / I P (IR+ x T) of all finite point measures on M + x T. 

T[x] 
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Figure 5: The spine decomposition 



We now state a "spine" decomposition of the Yule tree, which plays an important role in 
our approach. 



Proposition 4 (Spine decomposition). Let F be a nonnegative measurable junction on T, 
and let G be a nonnegative measurable function on JA P (M. + x T). Let r > 0. Then, 



3 



J2F(T[x])G(^ x (T)) 



xer r 



e r E[F(r)] xE[G(A0], 



where J\f(ds dT) is, under the probability measure P, a Poisson point measure on M P (R + x T) 
with intensity 2 l[o,r] ( s ) ds O(rfT) . 

This result is part of the folklore of the subject, and we leave the proof to the reader. 
Note that when F = 1 and G = 1, we recover the well-known property E[#r. r ] = e r . 

2.3 The continuous conductance 

Before we proceed to the proof of Theorem [2l we will define and study the continuous 
conductance C of the tree A, which plays a major role in this proof. Informally, the random 
variable C is defined by viewing the random tree A as a network of ideal resistors with unit 
resistance per unit of length and letting C be the conductance between the root and the 
set DA in this network. We will give a more formal definition using excursion measures of 
Brownian motion. To this end, and in view of further applications in the next section, we 
first define the excursion measure on a (deterministic) Yule-type tree. 

So let TeT, and consider the associated collection (z v ) ve \> as explained in the preceding 
section. For simplicity, we suppose here that z > 0. For every e G (0, z®), there is a unique 
x £ G T whose distance from the root is equal to e. Let n-y. e be the law of Brownian 
motion on T with drift 1/2 started from x e and stopped when it hits the root. Note that 
nj- e is a probability measure on the space C(M + ,7 _ ) of all continuous functions from R + 
into T. Then, it is easy to verify that the measures £^ 1 nq-^ converge when e — > 0, in 
an appropriate sense, towards a er-finite measure on C(R+,T), which is denoted by n-j- 
and called the excursion measure of Brownian motion (with drift 1/2) on the tree T . For 
uj G (7(11+, T), set T (uj) — inf{£ > : uj(t) = (0,0)}, with the usual convention inf = oo. 
The conductance C(T) is then defined by 

C{T) = n-j-(T = oo) = lim e~ 1 n-j- E (T = oo). 

Note that we have 1 < C(l~) < oo (using the form of the scale function of Brownian motion 
with drift, we immediately see that C(T) < (1 — e _Z0 ) _1 ). 

To simplify notation, we set C — C(T), which is a random variable with values in [1, oo). 
Because of the relations between the Yule tree T and the reduced tree A, the random con- 
ductance C may also be defined as the mass assigned by the excursion measure of Brownian 
motion on A (away from the root), to the set of trajectories that reach height 1 before coming 
back to the root. 

The distributional identity ([2]) is then obvious from the electric network interpretation: 
just view A as a series of two conductors, the first one being a segment of length U and 
the second one consisting of two independent copies of A (scaled by the factor 1 — U) in 
parallel. Alternatively, it is also easy to derive (pi from the probabilistic definition in terms 
of excursion measures, by applying the strong Markov property at the hitting time of the 
first node of the tree. We leave the details to the reader. 

Let us now prove that p| characterizes the law of C and discuss some of the properties 
of this law. For u £ (0, 1) and x, y > 1, we define 



G(u,x,y) := [u-\ ■ — , (5) 

V x + yj 
so that (pi can be rewritten as 

C^GO/A,^) (6) 
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where U,C\,C 2 are as in pi. Let ^# be the set of all probability measures on [l,oo] and let 
$ : ./# — > ,<# map a distribution a to 

*(tr) = Law(G(C/,X 1 ,X 2 )) 

where X\ and X2 are independent and distributed according to a and U is uniformly dis- 
tributed over [0, 1] and independent of the pair [Xi,X2)- 

Proposition 5. The law 7 of C is the unique fixed point of the mapping $ on ^K , and we 
have 3> fc (cr) — > 7 weakly as k — » 00, for every a £ ^#. Furthermore all moments 0/7 are 
finite, and 7 /ias a continuous density over [l,oo). Finally, the Laplace transform 

/>oo 

(p(£) = E[exp(-tC/2)] = / e~ er/2 7 (dr), £ > 



solves the differential equation 

2£ ip" (£) + V (£) + <p 2 (*) - <^) = 0. (7) 

Remark. In [T5] the authors discuss the conductance of an infinite super-critical Galton- 
Watson tree with offspring distribution 9. This conductance also satisfies a recursive dis- 
tributional equation, which depends on 9. One conjectures that the distribution of the 
conductance is absolutely continuous with respect to Lebesgue measure at least if 9(k) — 
for all sufficiently large k, see [TT1I2T)] . 

Proof. We start with a few preliminary observations. If a, a' £ ^# we say that a random 
pair (X, Y) is a coupling of a and a' if X is distributed according to a and Y is distributed 
according to a' . The stochastic partial order H on «/# is defined by saying that a -< a' if and 
only if there exists a coupling (X, Y) of a and a' such that X < Y a.s. It is then clear that 
the mapping $ is increasing for the stochastic partial order. 

We endow the set ^#1 of all probability measures on [1, 00] that have a finite first moment 
with the 1-Wasserstein metric 

dxto-,0-') = mf [E[\X-Y\] : (X,Y) coupling of {a, a')}. 

The metric space (^#i,di) is Polish and its topology is finer than the weak topology on 
j#\. From the easy bound G(u, X, y) < x + y, we immediately see that $ maps ^#1 into 
jft\. We then observe that the mapping <& is strictly contractant on ^(\. To see this, let 
[X\,Y\) and (X2, Y2) be two independent copies of a coupling between a, a' £ ^\ and let 
U be uniformly distributed over [0,1] and independent of (X\, Y\,X 2 , Y2). Then the two 
variables G(U,Xi,X 2 ) and G(U, Y\,Y 2 ) give a coupling of &(cr) and &(cr'). Using the fact 
that X u Y x , X 2 , Y 2 > 1 we have 

\G{U,X 1 ,X 2 )-G(U,Y 1 ,Y a )\ = (U+ } r ~ U „ y l -(U [ ~ l ' 



X X +X 2 J V Y x +Y 2 

(X 1 + X 2 -Y 1 -Y 2 )(l-U) 



(U(Xi + X 2 ) + 1- U){U{Y X + Y 2 ) + 1 - U) 
< (l^-m + ^-^l)^-^. 

Taking expected values and minimizing over the choice of the coupling between a and a', 
we get di(i>(o-),$(cr')) < 2(1 - log(2))di(<r,<r')- Since 2 ( l ~ log(2)) < 1, the mapping $ is 
contractant on ^#l and by completeness it has a unique fixed point 70 in dt\. Furthermore, 
for every a £ ^#1, we have & k (a) — > 70 for the metric di, hence also weakly, as k — > 00. 

Since we know from (T6J) that 7 is also a fixed point of <&, the equality 7 — 70 will follow if 
we can verify that 70 is the unique fixed point of $ in ^#. To this end, it will be enough to 
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verify that we have <J> fc (er) — > 70 as k — > 00, for every a £ ^ . Let a <G ^ and for every t G K 
set -F CT (i) = cr([t, 00]). Also set F^ ^(i) = P(^! + X 2 > t) where X 1 and A 2 are independent 
and distributed according to a. Then we have, for every t > 1, 

- P M7 < i" 1 and ^— ^ < X x + X, 



It follows that, for every t > 1, 



I 1 \ 1 — ui 



/_1 ^ ''■'' ^J 2) (*). (8) 



< 7* (s-1) 



, N F^ 2) (t) 2FM/2) 

**(«r)(*) < -^-^ < a \' ' ■ (9) 

We apply this to a = <J>(#oo), where S^ is the Dirac measure at 00. We have Fg^fJ;) = i _1 , 
and it follows that, for every t > 1, 



P* 2 (5oo)W 



< 
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This implies that <I> 2 (<5oo) S -^l- By monotonicity, we have also & 2 (cr) € ~<#l for every 
ct € ./#, and from the preceding results we get Q k (a) — > 70 for every a £ .<#. As explained 
above this implies that 7 = 70 is the unique fixed point of <& in ^ . 

Let us now check that all moments of 7 are finite. To simplify notation we write F — F^ 
and F (2) = F!f\ By (|8) we have for every t > 1, 

which implies that F(i) < 2F(t/2)/t for every £ > 1, by the same argument as above. Iterat- 
ing this inequality, we get that F(t) < c\ exp(— C2(logt) 2 ), with certain constants C\,ci > 0. 
It follows that all moments of 7 are finite. 

By construction, we have F^ 2 \t) — 1 for every i g [1,2]. It then immediately follows 



from (10) that we have 



F(t)= K ^ + l-K , We [1,2], (11) 



where 



Then we observe that the right-hand side of (10) is a continuous function of t € (l,oo), so 



that F is continuous on [1, 00) (the right-continuity at 1 is obvious from (111). Thus 7 has no 
atoms and it follows that the function F^ is also continuous on [1, 00) . Using ( 10 1 again we 
obtain that F is continuously differentiable on [l,oo) and consequently 7 has a continuous 
density / = —F' with respect to Lebesgue measure on [l,oo). By (111, /(£) = Kot~ 2 for 
t G [1, 2] and in particular /(l) = Kq. 



Let us finally derive the differential equation (J7j . To this end, we first differentiate (10) 
with respect to t to get that the linear differential equation 

t{t-l)F'{t)-F{t) = -F^(t). (12) 
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holds for t £ [1, oo). Then let g : [1, oo) — > R+ be a continuously differentiable function such 
that g(x) and g'(x) are both o(x a ) when i4oo, for some a £ (0, oo). From the definition 
of F and Fubini's theorem, we have 



dtg'(t)F(t)=E[g(C)}-g(l) 
J\ 

and similarly 

/OO 
dtg'{t)F^\t)^n9{Ci+C 2 )]-g{l) 

where C\ and C 2 are independent copies of C under the probability P. We then multiply both 



sides of (12 1 by g'(t) and integrate for t running from 1 to oo to get 

Epitd-iytCOREfotd)] = E[.9(Ci+C 2 )]. (13) 

When g(x) = exp(—x£/2) for £ > 0, this readily gives ^f\. D 



Remark. We may also take g{x) = x m for m £ {1,2,3,4,...} in (13). This leads to 
recursive formulas for the moments of C in terms of the first moment E[C] (simulations give 
E[C] « 1.72). 



Singular behavior of the density of 7. By (Ulk, the values of F and / = — F' on the 
interval [1,2] are determined by the constant Kq = /(l). We have not been able to obtain 
an exact numerical value for Kq, but simulations indicate that Kq w 1.47 (see Figure [3]). We 
may now observe that the values of F over [1,2] determine the values of F^ 2 ' over [2, 3], via 
the formula 

l-F^(t)=J dsf(s)(l-F(t-s)), Vie [2,3]. 



We can then use either (TL2]) or ( 10 1 to get a complicated explicit expression for F over [2,3], 
again in terms of Kq. By iterating the argument, we can in principle determine F by solving 
linear differential equations on the successive intervals [n,n+l],n=l,2,.... Unfortunately, 
the calculations become tedious and we have not been able to find a closed expression for 
F(t). However, from the expressions found for the first two intervals [1, 2] and [2, 3], one can 
verify that, although the function / is continuously differentiable on (1,3), one has 

r(2-) = 3 f, w het e. s /»(2 + )=^_M, 
so that / is not twice differentiable at the point 2. See the inflection point at 2 on Figure [3] 

2.4 The flow property of harmonic measure 

In this section we establish a property of harmonic measure that plays an important role in 
the proof of Theorem [2] This property is well known in the discrete setting, but perhaps less 
standard in the continuous setting, and we sketch a short proof. 

We fix a Yule- type tree T £ T. In this section only, we slightly abuse notation by writing 
W = (W t )t>o f° r Brownian motion with drift 1/2 on T started from the root. As previously, 
Woo is the exit ray of W, and the distribution of Woo is the harmonic measure of T. For 
every r > 0, if x is the unique point of %■ such that x -< Woo, we write Woo for the ray of 
T[x] that is obtained by shifting Woo at time r. 

Lemma 6. Let r > and x £ T r . Conditionally on {x -< Woo}> the law of Woo is the 
harmonic measure ofT[x\. 
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Proof. For simplicity, we suppose that x is not a branching point of T, and then we can 
choose e > sufficiently small so that there is a unique descendant x e of x in T at distance e 
from x. Clearly the harmonic measure of T[x] can be obtained by considering the distribution 
of the exit ray of Brownian motion started from x £ and conditioned never to hit x. On the 
other hand, by considering the successive passage times at x £ , we can also verify that the 

(r) 

conditional law of Wso knowing that x -< W^ corresponds to the same distribution. We 
leave the details to the reader. □ 

3 Proof of Theorem [2] 

Let us outline the main steps of the proof of Theorem [2] Proposition [7] below uses the spine 
decomposition (Proposition El and the Ray-Knight theorem for local times of Brownian 
motion with drift to determine the exact distribution of the subtree of the Yule tree above 



level r that is selected by harmonic measure. In Section 3.2 we use stochastic calculus 
to prove that this law converges as r —> oo to an explicit distribution, which is absolutely 
continuous with respect to (Corollary [To|) . In the last two subsections, we rely on arguments 
of ergodic theory, mainly inspired by [18J, to complete the proof of Theorem [2] 

We recall that P stands for the probability measure under which the Yule tree is defined, 
whereas Brownian motion (with drift 1/2) on the Yule tree is defined under the probability 
measure P. 

3.1 The subtree above level r selected by harmonic measure 

In this subsection, we fix r > 0. We will implicitly use the fact that T has a.s. no branching 
point at height r. 

There is a unique point x € T r such that x < Woo, and we set r( r ) = T[x], which is the 
subtree above level r selected by harmonic measure. We are interested in the distribution of 
r( r ). Let F be a nonnegative measurable function on T, and consider the quantity 



I r :=E(g).E[F(r (r) )] =E®E 



J2 F(T[x])l {x ^ Woo} 
igr r 



(14) 



where the notation E E means that we consider the expectation first under the probability 
measure P (under which the Brownian motion W is defined) and then under P. We will use 
Proposition HJ to evaluate I r . 

Let us fix x G r,. and R > r. We will use the notation T[x] := {y e T : x -< y}. This is 
just the set of all descendants of x in T, now viewed as a subset of T and not as a Yule-type 
tree as in the definition of T[x]. Define 



n,fl 



{yeT\T[x]:H(y)<R}UT[x}. 



Let W X,R be Brownian motion (with drift 1/2) on T X ' R (we assume that W X,R is reflected 
both at the root and at the leaves of T X,R , which are the points y of r\r[x] such that 
H(y) = R). We look for an expression of the probability that W X ' R never hits the leaves of 
Y X * R J or equivalcntly that W X,R escapes to infinity in r[x] before hitting any leaf of T X ' R . 
Write (£*' )t>o for the local time process of W X ' R at x. Note that we use here the standard 
normalization of local time as an occupation time density. With this normalization, £ x ' i s 
the a.s. limit as e — > of the quantities 2eN x ' e , where N x ' £ is the number of "up-crossings" 
of W X ' R from x to the point x e € V such that x -< x e and d(x, x s ) = e (this point is unique 
for e small) before time t. We claim that £^ R has an exponential distribution with parameter 
C(r[x])/2. This follows from excursion theory, but an elementary argument can be given as 
follows. Each time W X,R does an up-crossing from x to x e , there is a probability of order 
eC(r[a;]) that it escapes to infinity before coming back to x (by the very definition of C(r[a;])). 
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Hence the total number of up-crossings from x to x £ before escaping to infinity is geometric 
with parameter of order eC(r[x]), and our claim follows from the approximation of local time 
by up-crossing numbers. 

We then consider for every a £ [0,r] the local time process (Z"' )t>o of W X,R at the 
unique point of JO, x]] at distance a from the root. Note in particular that L r ' 
As a consequence of a classical Ray- Knight theorem, conditionally on I; 



rix.R 



x,R _ 



(L^q aR )o< a <r is distributed as the process (X a )o< a < r 
equation 

dX a = VZX^drja + (2 - X a )da 

X = l 



the process 
which solves the stochastic differential 



(15) 



where (j] a ) a >Q is a standard linear Brownian motion. Note that we are here using a version 
of the Ray- Knight theorem for Brownian motion with drift (see e.g. |lj, p. 93), which however 
follows from the usual version by simple scale and time change transformations. Further- 
more, we used the independence of excursions of W x ' "below" and "above" x to get that 
conditionally on i x ^ — £ the collection (L^, fl ) < a <r has the same distribution as the local 
times of a Brownian motion with drift 1/2 on the interval [0, r] (reflected at both ends of this 
interval) started from and stopped at the first time when its local time at r hits the value 
i. In what follows, we will write Pg for the probability measure under which the process X 
starts from I, and Pi c \ for the probability measure under which the process X starts with 
an exponential distribution with parameter c/2. 

Now write Xj, 1 < j < k for the branching points of T X ' R (or equivalently of T) that 
belong to [[0,a;]], and set aj = H(xj) for 1 < j < k. Also let 7 x ,j.R be the subtree of T X ' R 
that branches off [[0, x}} at Xj. Let A Xt R be the event that W X,R never hits the leaves of T X ' R . 
Excursion theory shows that the conditional probability of A^r knowing (L^) < a <r is 



cxpf-^ctr,^)^) 



i=i 



where C{T x ^_r) refers to the conductance of T x ,j,R between its root Xj and the set of its leaves 
(this conductance is defined by an easy adaptation of the considerations of the beginning of 
Section |2.3h . 



From the preceding observations, we have thus 

k 



P{A x ,r) 



E 



1 



exp -.E^^)^ 



j=i 



E, 



(C(T[x])) 



1 
eXP ( ~ 2^ C ( T x,J,R) X r-a.i) 



i=l 



(16) 



At this point, we let R tend to infinity. It is easy to verify that P(A x r) increases to P(A X ), 

where A x — {x -< IFoo}- Furthermore, for every j £ {l,...,fc}, C(T x ,j,r) decreases to 

C(T x j), where r x j is the subtree of T branching off [[0, x\] at Xj. Consequently, we obtain 

that 

1 k 
P(x -< Woo) = E {c{r[x])) exp ^ - - ^2 C ( r x,j) A r _ Qj 



We can now return to the computation of the quantity I r defined in (14 1. We get 

/ r = E[^f(r[i])P(^^ 



x er r 



e[ J2 F (nx\)E t 



(C(r[x])) 



eXP ( ~ 9 ^2 C ( Tx ^ X r-a 3 



X £T r 



J'=l 
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Note that the quantity inside the sum over x £ T r is a function of T[x] and of the subtrees 
of T branching off the segment [[0,x]]. We can thus apply Proposition [4] and we get 



I r = e r / Q(dT) F(T) E 



E, 



(C(T)) 



exp 



Nr{dadT')C{T')X r _ a 



where under the probability measure P, N r (dadT') is a Poisson point measure on [0,r] x T 
with intensity 2daO(dl~ / ). We can interchange the expectation under P and the one under 
-P(C(T))i an d using the exponential formula for Poisson measures, we arrive at 



I r = e r J e(dT)F(T)E ic{T)) [exp-2j da(l-<p(X a )) 
where we recall that for every s > 0, 

<p{8) = E[exp(-sC/2)] = ef exp(-sC(T)/2) 



(17) 



is the Laplace transform (evaluated at s/2) of the distribution of the conductance of the Yule 
tree. We have thus proved the following proposition. 

Proposition 7. The distribution under P (g) P of the subtree Y^ r > has a density with respect 
to the law Q{dT) of the Yule tree, which is given by $ r (C(T)), where, for every c > 0, 



$ r (c) = E (c) 



exp- / da{\-2<p{X a )) 
o 



3.2 Asymptotics 

In this section, we study the asymptotic behavior of <Iv(c) when r tends to oo. We first 
observe that, in terms of the law 'y(ds) of C(T), we have 



<P& 



-^/ 2 7 (d S ), v'{C) 



l,oo) 



-es/2 



[l,oo) 



j(ds). 



It follows that ip{£) < e- { / 2 and \tp'(£)\ < |(/ S7(ds))e" £/2 . By differentiating Q, we have 
also 

2£ (p'"(£) + (2 + l)<p"(l) + 2<p(£)tp'(e) = 0. (18) 



Our main tool is the next proposition. 
Proposition 8. For every £ > 0, 



lim Et 



exp- 



da(l-2<p(X a )) 



^'(£)e f / 2 
f™ds<p'(s) 2 e s / 2 ' 



Additionally, there exists a constant K < oo such that, for every £ > and r > 0, 



Ef 



exp- 



da(l-2<p(X a )) 



< K. 



Proof. Under the probability measure P<c(T)) the process X starts with an initial distribution 
which is exponential with parameter C(T)/2. Consequently, under J Q{dT) P(ecr))-> the 
initial density of X is 

q(£)= f 7 (ds) S -e- si / 2 = -?'(£). (19) 

Jll.oo) Z 
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However, from (17 1 with F = 1, we have 



1 = J ®{dT)E {C{T)) 
= - I ' dt<p'{l)Ei 



cxp- / da(l-2ip(X a )) 

o 

rs;> - j da(l-2tp(X a )) 
o 



We can generalize the last identity via a minor extension of the calculations of the pre- 
ceding section. We let L° x be the total local time accumulated by the process W at the root 
of r. Let r > and let h be a bounded nonnegative continuous function on (0,oo), and 
instead of the quantity I r of the preceding section, set 



E&E 



h(Ll) "£F(T[x])l {x 



^Woo} 



xer r 



where F is a given nonnegative measurable function on T. The same calculations that led 



to ( 16 1 give, for every ier r and R> r, 

1 k 
h{L° 00 R )^(--Y,C{T Xtj , R )L a J > 



E[h{I%?)l FmtR ] = E 



i=i 



E, 



(C (!>])) 



1 k 

h(X r ) exp ( - ^y^ y C(T XtjjR ) X r ^ a3 



3 = 1 



(20) 



When R — > oo, L°^ R converges to L^,, and so we get 



E [H L< L) 1 {x^,w 00 }] = E (C{r[x])) h(X r ) exp ( - - ^C(r xJ ) X r _ a . 



j=i 



We then sum over x € T r and integrate with respect to P. By the same manipulations as in 
the preceding section, we arrive at 



I 1 ; = e r I Q(dT) F(T) E (C{T)) \h(X r ) cxp -2 / da (1 - <p(X a )) 



(21) 



Note that if F = 1, 



I^ = E®E[h(L° x )} 



<p'(£)h(£)d£ 



since given T = T the local time L^ follows an exponcnticl distribution with parameter 



C(T)/2, and we use the same calculation as in (19l. Hence the case F = 1 of (21 1 gives 

d£ <p' (£) Et h(X r ) exp -/ da(l-2y(X a )) = d£<p'(£) h{£). (22) 

o L Jo ^ Jo 

By an obvious truncation argument, this identity also holds if h is unbounded. 
At this point, we need a lemma. 

Lemma 9. The process 

M a := -<p'(X a ) exp (^ - / da (1 - M**))) 

is a martingale under Pi, for every £ > 0. 
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Proof of Lemma [9l From the stochastic differential equation (151, an application of Ito's 
formula shows that the finite variation part of the semimartingale — M a is 



(2X s y'"{X s ) + (2 + X s y{X s ) + 2ip{X,)<f?{X a )) exp (-£■-[ du(l- 2 ¥ >(.X U ))) ds 



and this vanishes thanks to (18). Hence M is a local martingale. Furthermore, we already 



noticed that, for every I > 0, \y'{£)\ < Cer 1 / 2 , where C := \j sj(ds). It follows that \M\ 
is bounded by C e a over the time interval [0,a], and thus M is a (true) martingale. □ 



We return to the proof of Proposition [8] Let £ > and t > 0. On the probability space 
where X is defined, we introduce a new probability measure Q\ by setting 

Note that the fact that Q\ is a probability measure follows from the martingale property 
derived in Lemma [9] Furthermore, we have Pg a.s. 



M t _ tp'(Xt) (Xt_ 
M ~ <?{£) 6XP 1 5 



ds(l-2<p{X a )) 



so that the martingale part of log H 1 is 

<* <p"{X s ) 



JX S dr] s + 2 / \/X s dr] s , 

Jo <P{Xs) 



where r\ is the linear Brownian motion in (15 1. An application of Girsanov's theorem shows 
that the process 



Vs ■= Vs 



'x,M 



2<p"(X u 



( 1 + f ^f)"- »^^ 



is a linear Brownian motion over the time interval [0,t], under Q\. Furthermore, still on the 
time interval [0,£], the process X satisfies the stochastic differential equation 

dX s = 2^/Y s dij s + 2X S ( 1 + ^Vv ) ds + (2 - X s )ds, 



<P'(X S ) 



or equivalently, using (m|, 



dX s = 2^/A^ d?j s + (2 - X s + 2 ^ f (X s )\ ds. 



(23) 



Notice that the function 



^^L(t) 



is continuously differentiate over [0, 00), takes negative values on (0, 00) and vanishes at 0. 
Pathwisc uniqueness, and therefore also weak uniqueness, holds for (23 1 by an application 



of the classical Yamada-Watanabe criterion. The preceding considerations show that, under 
the probability measure Q\ and on the time interval [0,i], the process X is distributed as 
the diffusion process on [0, 00) with generator 



d 2 ( n 


, ^-V 2 r \\ d 
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started from £. Write X for this diffusion process, and assume that X starts from t under 
the probability measure Pg. Note that is an entrance point for X, but, independently of 
its starting point, X does not visit at a positive time (in fact this follows from the fact 
that X does not visit at a positive time). 

A standard comp ariso n the orem for stochastic differential equations can be used to com- 
pare the solutions of (15) and (23 1, and it follows that X is recurrent on (0, oo). 



We next observe that the finite measure p on (0, oo) defined by 

p(di) := <f'(£) 2 e £ / 2 d£ 
is invariant for X . Indeed, we have, for any bounded continuous function h on (0, oo), 

d£^'(£) 2 e^ 2 E e [h(X t )} 

d£i P '(£) 2 e i / 2 Q t e [h(X t )} 



(0,oo) 



(0,oo) 



d£ip'(£)E e \h(X t )ip'(X t )exp(^- / ds (1 - 2(p(X a )) 

(O.oo) L V Z JO 



d£ip'(£) 2 e e/2 h(£), 



(0,oo) 



where the last equality follows from ( 22 ) . We normalize p by setting 

P 



P = 



p((0,oo))- 



It is then easy to prove, for instance by a coupling argument, that the distribution of X t 
under Pi converges weakly to p as t — > oo, for any £ > 0. Consequently, for any bounded 
continuous function g on [0, oo), and every £ > 0, 



Ee[g(X t )} 



9 dp. 



(24) 



We claim that ( |24[ ) remains true if g is continuous, monotone increasing and nonnegative 
(but non necessarily bounded) and such that J g dp < oo. To see this, introduce a monotone 
increasing sequence of truncation functions hk, k= 1,2,... on [0, oo) such that < h% < 1, 
hk = 1 on [0, k] and hk has compact support. By the bounded case, 



E e [g(X t )h k (X t 



gh k dp 



and the limit is close to J g dp when k is large, by dominated convergence. On the other 
hand, for every £ > 0, 



\E t \g(X t )] ~ E,[g{X t )h k {X t )]\ < E t [g{X t ) 1 { ~ >fc} ] 



<{p{%oo))- 



[l,oo) 



p(dr)E r [g(X t )l 



{x t >k}i 



using the monotonicity of g and the fact that if r > £ we can couple the diffusion process 
with generator C started from r and the same diffusion process started from £ in such a way 
that the first one stays greater than or equal to the second one. Furthermore, 



p{dr) E r [g(X t ) l^ t>k} ] < / p{dr) E r [g(X t ) 1 |5 ? >fc> ] = / p(dr) g{r) l {r>k} 
is small when k is large by dominated convergence, and this proves our claim. 
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We can thus apply ( 24 1 to the function 



which satisfies the desired properties and in particular is such that J g dp — — J <p'(l) di = 1. 
Note that for this particular function g, 



E e [g(X t )} = Q\[g{X t )] = 



e -i/2 



E, 



It follows from (24 1 that, for every £ > 0, 

t 



e -i/2 

lim — - E, 



exp i 



{-J da(\-2tp(X.)j) 



exp^- / ds(l-2<^(X s )) 



.g dp: 



p((0,oo)) L ,dy(a) 2 eV2 



This gives the first assertion of the proposition. 

The second assertion is now easy. By the first assertion, there exists a constant K such 
that, for every r > 0, 



E 



exp 



(- / ds(l-2<p(X s ))) 



< if. 



Since the function 99 is monotone decreasing, a comparison argument gives for every £ > 
and r > 0, 



ft 



exp 



(- / dsO—2<p{X a ))) 



<E 



exp 



ds (1 - 2ip(X a )) 



< K. 



This completes the proof of the proposition. 
To simplify notation, we set 



C := / ds V \s) 2 e s ' 2 = / / j(d£h(d£') 



(£ 



2(£ + £'-!)' 



Corollary 10. For every c > 0, 



lim $ r (c) = $oo(c) 



D 



where 



*-® = kh {da) 2{c+s-lY 



Proof. By definition, we have 

c f° 



-rf/2 



E t 



exp 1 



(- / da(l-2<p(X a ))) 



From Proposition [8] and an application of the dominated convergence theorem, we get 



c f c 
lim * r (c) = - / 



!e- ce / 2 x 



V On / 



/o Co 

The limit is identified with ^^(c) by a straightforward calculation. 



□ 
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3.3 The invariant measure 

For the purposes of this section, it will be useful to introduce the set of all pairs consisting 
of a tree TeT and a distinguished geodesic ray v, which we can represent by an clement of 
{1,2} N . We formally set 

T* =Tx {1,2} N . 

We can define shifts (r,.) r >o on T* in the following way. For r — 0, r r is just the identity 
mapping of T*. Then let r > and (T, v) e T*. Write v = (i>i, V2, ■ ■ ■) and v„ = (vi, ■ ■ ■ ,v n ) 
for every n > 0. Also let x r . v be the unique element of T r such that x r>v -< v . Then, if 
k = min{n > : z Vn > r}, we set 



Tr(T, v) = \T[x r , v } , (Vfc+1, Ufc+2, ■■•))■ 



Informally, r r (T,v) is obtained by taking the subtree of T consisting of descendants of the 
vertex at height r on the distinguished geodesic ray, and keeping in this subtree the "same" 
geodesic ray. It is straightforward to verify that r r o t s = r r+s for every r, s > 0. 

Under the probability measure P (£> P, we can view (r, Woo) as a random variable with 
values in T*. Write 0* for the distribution of (r, Woo). Then 0* is not invariant under the 
shifts T r , but Corollary [10] will give an invariant measure absolutely continuous with respect 
to 6*. 

Proposition 11. The probability measure 

A*(dTdv) :=$oo(C(T))e*(drdv) 

is invariant under the shifts r r , r > 0. 

Proof. Let r > 0. We have 

r r (r,w 00 ) = (r«,w«), 

where r( r ) and WoV are as in the previous sections. 

By Proposition [7] we have, for any bounded measurable function FonT, 

E ® E[F(rM)] - f e(dT) $r(C(T)) F(T). 

Write vj- for the harmonic measure of a Yule-type tree T ■ At this point we use the flow 
property of harmonic measure. By Lemma [6] and the preceding identity, we have also, for 
any bounded measurable function F on T*, 



E <g) E[F(T^ , W^ )} = E (g) E [ / i/ rW (dv) F(r< r ) , v) 

= j @{dT)$ r {C{T)) j u T {dv)F{T,v) 

= J 9*(dTdv)$ r (C(T))F(T,v), (25) 

since Q*(dT dv) = Q(dT)vT{dv) by construction. 
If we now let r — > oo , Corollary [10] gives 

lim E®£[F(rW,W«)]= [Q*(dTdv)$ 00 (C(T))F(T,v) 

noting that the functions <i> r are uniformly bounded thanks to the last assertion of Propo- 
sition [8j Let s > 0. If we replace F by F o t s in the last convergence, observing that 
F o r s (rM, wio r) ) = F o r s o r r (r, Woo) = F{T^ s+r \W^ +r) ), we get 

ye*(drdv)$o C ,(c(r))F(r,v) = y'e*(drrfv)<i>o (c(r))FoT S (r,v), 

which was the desired result. □ 
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Proposition 12. For every r > 0, the shift r r acting on the probability space (T*,A*) is 
ergodic. 

Proof. We take r = 1 in this proof, and we write t = t± for simplicity. We essentially rely on 
ideas of [H] (see also [201 Chapter 16]). However our setting is different, because our trees 
are not discrete, and also because we consider ordered trees rather than unordered trees in 
[18] , For this reason, we will provide some details. We write m for the canonical projection 
from T* onto T, and let A be the image of A* under this projection, so that 

A(dr) = $oc(c(r))e(dr). 

We define a transition kernel p(T, dT') on T by setting 

p(T,dV) = J2 ^r({v G {1,2} N : x -< v}) 6 T[x] (dT). 

Informally, under the probability measure p(T, dT'), we choose one of the subtrees of T above 
level 1 with probability equal to its harmonic measure. Then it follows from Proposition 1 1 1 1 
that A is a stationary probability measure for the Markov chain with transition kernel p. 
Indeed, Lemma [6] shows that we may obtain this Markov chain under its stationary measure 
A by considering the process 

Z n {T, v) = 7n(7-„(7»), n = 0, 1,2, . . . 

on the probability space (T*,A*). Note that Z (T,v) = T. 

Write T°° for the set of all sequences (T°, T 1 , . ■ .) of elements of T. By [201 Proposition 
16.2], if a measurable subset F of T°° is shift-invariant for the Markov chain Z, in the sense 
that 1f(Zq, Z\, . . .) = \f{Z\, Z 2 , ■ ■ ■) a.s., then there exists a measurable subset A of T such 
that 

1 F (Z ,Z 1 ,...)^1 A (Z ), a.s. 

and moreover 

p(T,A) = l A (T), A(dT)&.s. 

We let T°° be the set of all sequences (T° , T 1 , . . .) in T°°, such that, for every integers < 
i < j, T 3 is a subtree of T above generation j — i (i.e., there exists a point x 6 7?_j such that 
T j =T i [x\). Note that f°° is a measurable subset of T°° and that (Z (T, v), Z X (T, v), . . .) € 
T°° for every (T, v) € T*. If (T^.T 1 , . • .) € f °°, there exists v € {1,2} N such that T j = 
Zj(T°,v) for every j > 0, and we set ^(T°,T l , ■ ■ ■) = (T°,v). Note that v is a priori 
not unique, but for the previous definition to make sense we take the smallest possible v in 
lexicographical ordering (of course for the random trees that we consider later this uniqueness 
problem does not arise). In this way, we define a measurable mapping "J from T°° into T*, 
and we have ^(Z (T, v), Zi(T, v), . . .) = (T,v), A* a.s. 

Let us now prove the statement of the proposition. We let B be a measurable subset of 
T* such that t^ 1 (B) = B, and we aim at proving that A*(B) = or 1. To this end, we set 
F = , $>~ 1 (B), which is a measurable subset of T 00 C T°°. Furthermore, we claim that F is 
shift-invariant. To see this, we have to verify that 

{(Z , Z x , ...)£F}= {(Z U Z 2 , . . .) e F}, a.s. 

or equivalently 

{V(Z ,Z 1 ,...)€B} = mZ 1 ,Z 2 ,...)€B}, a.s. 

But this is immediate since by construction ^f(Z\, Z2, ■ • ■) = to ty{Z§,Z\, . . .) a.s. and 
t^ 1 (B) = B by assumption. 

From preceding considerations, we then obtain that there exists a measurable subset A 
of T, such that (Zo, Z\, . . .) £ F if and only if Zq G A, a.s., and moreover p(T, A) = 1a(T), 
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K(dT) a.s. Since ^f(Zo(T, v),Zi(T,v), . . .) = (T,v), A* a.s., it also follows that we have 
(T, v) e B if and only if T G A, A* a.s. 

However, from the property p(T, -A) = 1^(7"), A(dT) a.s., one can verify that A(A) = 
or 1. First note that this property also implies that p(7~, A) — 1a(T), O(dT) a.s. Hence, 
Q(dT) a.s., the tree T belongs to A if and only if each of its subtrees above level 1 belong to 
A (it is clear that that the measure p(T, •) assigns a positive mass to each of these subtrees). 
Then, if p^ = P(#Fi = k), for every k > 1, the branching property of the Yule tree shows 
that 

oo 

6(A) = ^ Pfc e(A) fc 
fe=i 

which is only possible if 0(A) — or 1, or equivalently A(A) = or 1. Finally, we also get 
that A*(B) — or 1, which completes the proof. □ 

3.4 End of the proof 

Recall that vj- stands for the harmonic measure of a tree T G T. With this notation, we 
have v = i>y- For every r > 0, we then consider the nonnegative measurable function F r 
defined on T* by the formula 

F r (T,v) = iA r (B r (v,r)) l 

where Bf(v, r) denotes the set of all geodesic rays of T that coincide with the ray v over 
the interval [0, r]. We claim that, for every r, s > 0, we have 

F r+S = F r x F s o r r . 

Indeed, if we write T r (T,~v) — (T^^^), this is equivalent to saying that 

v t (Bt(v, s + r)) w 

and the latter equality is an immediate consequence of Lemma [6j 
If we set G r = — log F r > 0, we have for every r,s > 0, 



and the ergodic theorem (with Proposition 12 1 implies that 



1 Qs A^a.s. A *( Gi ^ 

Since A* has a strictly positive density with respect to 0* , the latter convergence also holds 
0* a.s. Recalling that 0* is the distribution of (r, Woo), this exactly gives the convergence 
(fTl), with (3 — A*(Gi). This completes the proof of Theorem [21 except that we have not 
checked that j3 < 1. We will do this in the next proposition, and then we will complete the 
proof of Proposition [3] by deriving the explicit formulas pi for /3 in terms of the law 7 of the 
conductance C(T). 

Proposition 13. We have f3 < 1. 

Proof. Here again, we strongly rely on ideas from [TH] (see also [20] Chapter 16]). We start 
with some notation. If T G T and x G 71, we set 

#) = , r ({ve{l,2f:x^v}). 

Clearly (^7-(a;))xeTi is a probability distribution on 7i- We also set, for every T G T, 

t/(T) = liminf e-' r #T r G [0,oo]. 

r—too 
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It is well known that the preceding liminf is a limit, O(cTT) a.s., and that the distribution 
of U(T) under O(dT) is exponential. It follows that, for 6-almost every T £ T, we can also 
define, for every x G 71, 

U T {x) = lim e- r #T r -i[x) = - U(T[x}), 

and, if we set 

U T (x) 

UT{X) = U(T) 

the collection (^^(x))^^^ is a probability distribution on 71- 
By a concavity argument, we have 

xeTi TK ' 

and the inequality is even strict if {y^-{x)) x ^j\ ^ (wj-(x)) x ^f 1 . It is easy to verify that the 
latter property holds with positive probability under 0. We leave the details to the reader. 
Next we have 

^ A, ^ = / l0g ( MBr(v,l)) ) At(cfrdV) 



</E^)^(^)a(^t), 



where the last line follows from (26 1 and the fact that (y^-{x)) x ^j 1 ^ (ut{x)) xeTi with 
positive probability under 0, hence also under A. Next, recalling the Markov chain (Z n ) 
introduced in the proof of Proposition [12J we have 

£ •*(*> ^ (uk) ) A(rfr) = / ^ ^ (x) log (^fS ) A(rfr) 

= 1 

because Zo and Zi have the same distribution under A*, and we also use the fact that 
log U(T) is integrable under Q{dT) hence under A(dT). Together with the preceding display, 
this completes the proof. □ 

Proof of Proposition [3} The first assertion of Proposition [3] follows from Propositionjs] 
To complete the proof of Proposition [3] we start by establishing the first half of formula pF, 

that is, 

2 JJJ ^drMdsMdt) ^^r log g 

P J/ 7 (dr) 7 (d S )^b T ' [{) 

We use the notation of the beginning of this section, and we first fix e > and define a 
function H e on T* by setting 



n it v \ - / ° if z » - e ' 

" el ' ' Vj " 1 - log MW G {1, 2} N : Vl -< v'}) if z < e, 

where we write T = (zu)uev as previously, and we recall the notation v„ from the beginning 



of Section |3li| Clearly, H s (T,v) < G e (T,v), and # £ (T,v) = G e (T,v) if z Vl > e. More 
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generally, H e o r r (7", v) = G £ o tv(7",v) if there is at most one index i > such that 
r < z vi < r + e. It follows from these remarks that, for every integer n > 1, 



n-l 



Gl > X] ^1/" ° **/„ (28) 



fe=0 



and, for every (T, v) G T*, 



n-l 



Gi(7» = lim Vi?i/„or fc/n (r,v). (29) 

71.— ^OO ^ ' ' ' 



n— ¥ao 

k=0 



Let us then investigate the behavior of Q*(H E ) when e — > 0. It will be convenient to write 
7(i) and 7(2) f° r the two "subtrees" of T obtained at the first branching point (formally T(i) 
corresponds to the collection (zi V — z )„ e v, for i = 1 or 2). We observe that, if i = 1 or i = 2, 
the exit ray of Brownian motion on T will belong to {(i, v 2 , «3, . . .) : (v 2 , «3, • ■ ■) £ {1, 2} N } 
with probability 

C(7(i)) 
C(7(i))+C(7( 2) )' 
Thanks to this observation, we can write 

A*(jy e ) = -ye(dn* oo (c(T))i { , 0<e} 

/ C(7^d) C(7(i)) C(7( 2) ) C(7( 2) ) 

X VC(^ 1) ) + C(7^ 2) ) 0g C(7^ 1) )+C(^ 2) ) + C(^ 1) )+C(7^ 2) ) 0g C(^ 1) ) + C(7^ 2) ) 

- -2/ 9(dT) ^(C(7)) l^> C(7 ;gX W ) "« ^j^)) ' 
by a symmetry argument. An easy calculation gives 

C{T (1) ) + C{T (2) ) 



C(T) = 



e-'* + (l-e->*)(C(T(i ) ) + C(T {a) )) 



Since, under ®(dl~), 7(i) and 7(2) are independent and distributed according to 0, and are 
also independent of z® , we get 

x r <fae-* r C(T) + C(r) >) 

' $oo Ve- + (i- e -)(c(r)+c(r0)^ 



n 



Since ^oo is bounded and continuous, and the function (T, T ) H> efTI+ecr'*) ^°^ CfTT+CTTH ^ s 
clearly integrable with respect to Q{dT)Q(dT'), we can let e — > in the preceding expression 
and get 

C(T) ,_ C(71 



lim^A*(/f £ ) = -2 J J Q(dT)e(dT') 0>oo(C(r) + C(T0) ^|jj^pj log 



C(T)+C(V) b C{T)+C{T'Y 

(30) 



Since the limit in the preceding display is finite, we can use (29 1 and Fatou's lemma to get 



that A*(Gi) < oo, and then (28) (to justify dominated convergence) and (29 1 again to obtain 
that 

A*(Gx)= lim nA*(H 1/n ) 



formula (27 1 



coincides with the right-hand side of ( 30 1 . Finally, we use the expression of $00 to obtain 
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We will now establish the second half of formula Efy, which will complete the proof of 
Proposition [3] We let C ,Ci,C 2 be independent and distributed according to 7 under the 
probability measure P. Then, the denominator of the right-hand side of (27 1 can be written 
as 

E [ CoCl 
[C0+C1-I 

On the other hand, the numerator is equal to 



2E 



CqCi 



Co 



C (d 



Co-1 



log( 



C 2 )log(d 



C1+C2 
Ci 
d) 



C + d + d - 1 
= E[/(C 1 +d)]-E[< ? (C 1 - 
where we have set, for every x > 1, 

C a; 



E 



(C + C 2 )Ci log(Ci 



C + Ci + C 2 - 1 



C 2 )] 



f(x) = E 



Co 



1 



log a; 



and g(x) = E 



log Co 



Using (13), we can replace E[/(Ci 
g, to obtain 

E[/(C 1 + C 2 )]-E[ 5 (C 1 +C 2 )] 
C0C1 



•C 2 )]byE[/(C!)] 



Cpx 

Co + x - 1 
E[Ci(Ci - l)/'(Ci)], and similarly for 



= E 



Co 



E 



-E 



Ci-1 
C0C1 
C0 + C1-I 
Cpd (C - 1) 
Co + Ci - 
CqCi(Co 



logCi 

log C 



E 



-E 



C (Cq-1)Ci(Ci-1) 
(Co + d-1) 2 
C (Cq-1)Ci(Ci-1) 
(Co + d-1) 2 



logd 
log C 



■E 



C Ci(d - 1) 



Co + Ci - 1 



1 



d-l)-C d 



E[C ] 



Co 
-E 



■d-1 

CqCi 
Co + d-1 



If we substitute this in (27), we arrive at 

2/3 = 



E[C ] 



E 



C0C1 



_C +Ci-1_ 

which gives the second half of ^ and completes the proof of Proposition [31 
Remark. Despite all that is known about the distribution 7 (see Section |2.3[ ) , it requires 
some work to derive the fact that /3 < 1 (Proposition 13 1 from the explicit formulas of 
Proposition [3j The approximate numerical value (3 — 0.78... is obtained by first estimating 
7 using Proposition p^ (or more precisely the convergence of <I> fc (cr) to 7, for any probability 
measure a on [1, 00)), and then applying a Monte-Carlo method to evaluate the integrals in 
the right-hand side of pi). 



4 Gait on— Watson trees 

In this section we prove Theorem [T] We first explain why discrete reduced trees converge 
modulo a suitable rescaling towards the continuous reduced tree A. This leads to a first 
connection between the discrete harmonic measures and the continuous one (Proposition [17 1 . 
Combining this result with Theorem [2] one gets a first estimate in the direction of Theorem 1 
(Corollary 18 1. The recursive properties of Galton- Watson trees are then used to complete 



the proof of Theorem [T] 
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4.1 Notation for trees 

We use the standard formalism for trees. A (discrete) tree T is a finite subset of 



U = \j N'\ 



ra=0 

where N° = {0}, such that the following holds: 

(i) e t. 

(ii) If u = (ui, . . . , u n ) 6 t\{0} then u := [u\, . . . , u n -x) € r. 

(iii) For every u = (uj, . . . , u n ) € r, there exists an integer k u {j) > such that, for every 
j £ N, (m, ...,u n ,j) £t if and only if 1 < j < fc„(r). 

We view a tree r as a graph whose vertices are the elements of r and whose edges are 
the pairs {u, u} for all u £ t\{0}. 

We will use the notation and terminology introduced at the beginning of Section |2.1| in 
a slightly different setting. In particular, \u\ is the generation of u, uv denotes the concate- 
nation of u and v, -< stands for the genealogical order and u A v is the maximal element of 
{w e U : w -< u and w -< v}. 

Let T be a tree. The height of T is 

h(T) = max{|w| : v e T}. 
The set T is equipped with the distance 

d(v,w) = -(H + \w\ - 2\v A w\). 

Notice that this is half the usual graph distance. We will write BT(v,r), or simply B(v,r) 
if there is no ambiguity, for the closed ball of radius r centered at v, with respect to the 
distance d, in the tree T. 

The set of all vertices of T at generation n is denoted by 

T„ ~{v eT : \v\ = n}. 

If v € T, the subtree of descendants of v is 

T[v] := {V S T : v -< v'}. 

Note that T[v] is not a tree with our definitions, but we turn it into a tree by relabelling its 
vertices, setting 

T[v] :={)»eW:»i«e T}. 

If v S T, then for every i G {0, 1, . . . , |«|} we write (v)i for the ancestor of v at generation 
i. Suppose that \v\ — n. Then Bj(v 7 i) flT„ = T[(w)„_i] n T n for every i £ {0, 1, . . . ,n}. 
This simple observation will be used several times below. 

Gait on— Watson trees. Let 8 be a probability measure on Z + , and assume that 9 has 
mean one and finite variance a 2 > 0. For every integer n > 0, we let T 1 ™) be a Galton- 
Watson tree with offspring distribution 9, conditioned on non-extinction at generation n, 
viewed as a random subset of U (see e.g. [IS]). In particular, T^ ) is just a Galton- Watson 
tree with offspring distribution 9. We suppose that the random trees T^") are defined under 
the probability measure P. 

We let T* n be the reduced tree associated with T^ n \ which consists of all vertices of T^™) 
that have (at least) one descendant at generation n. A priori T*™ is not a tree in the sense 
of the preceding definition. However we can relabel the vertices of T* n , preserving both the 
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lexicographical order and the genealogical order, so that T* n becomes a tree in the sense of 
our definitions. We will always assume that this relabelling has been done. 

Note that |u| < n for every u £ T*". It will be convenient to introduce truncations of 
T* n . For every s £ [0, n], we set 

i? s (-p") = { V eT" l :M<n-LsJ}- 

We then consider simple random walk on T*"(w), starting from the root 0, which we 
denote by Z n = (Z^)k>o. This random walk is defined under the probability measure P (as 
previously it is important to distinguish the probability measures governing the trees on one 
hand, the random walks on the other hand). 

We let 

H„ =inf{/c>0: \Z 7 k l \ = n} 

be the first hitting time of generation n by Z n , and we set 

S n = z Hn . 

The discrete harmonic measure /i„, is the law of £„ under P. Notice that fj, n is a probability 
measure on the set T* n of all vertices of T* n at generation n. 

We start with a lemma that gives bounds on the sizes of level sets in T* n . 

Lemma 14. There exists a constant C depending only on 9 such that, for every integer 
n > 2 and every integer p such that 1 < p < n/2, 

E[(log#T™ p ) 4 ] 1/4 <C'log n and E[Qog#T?W < C logn. 

Proof. Set q n — P(/i(T^ ') > n). By a standard result (Theorem 9.1 of [21 Chapter 1]), we 

have 

2 

q n — — -, as n -> co. (31) 

Then, for every p £ {0, 1, . . . , n}, 

E[#T™ p ] = £[#{*; £ l { :] p : h(T^[v}) > p}} = (q^ 1 E[#{t- £ T^ p : h(T^[v]) > p}}. 

By the branching property of Galton-Watson trees, the conditional distribution of #{ij £ 
T„2„ : h(T^[v]) > p} knowing that #T„2 = k is the binomial distribution B(k, q p ). Hence, 

E[#Tr- P ] = 9pE[#T "- p] = ^. 

We can find a > such that the function x — » (l°g( a + ^)) 4 is concave over [1, oo). Then, 
E[(log#T™ p ) 4 ] 1/4 < E[(log(a + #T™ p )) 4 ] 1 /4 < l og(a + E[#T«l p ]) = log(o + J), 

Qn 



and the bounds of the lemma easily follow from (31 ). □ 



4.2 Discrete and continuous reduced trees 
4.2.1 Convergence of discrete reduced trees 



Recall from Section 2.1 the definition of the continuous reduced tree A. For every e £ (0, 1), 
we have set A £ = {x £ A : H(x) < 1 — e}. We will implicitly use the fact that, for every 
fixed e, there is a.s. no branching point of A at height 1 — e. The skeleton of A e is defined 
as 

Sk(A £ ) := {0} U {v £ V\{0} :Y(,<l-e} = {0}U{v£ V\{0} : (v, Y,) £ AJ. 
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Consider then a (discrete) tree T such that every vertex of T has either 0, 1 or 2 children. 
It will be convenient to write Tbi n for the collection of all such trees. With T we associate 
another tree denoted by [T] , which is obtained by "removing" all vertices that have only one 
child. More precisely write <S(T) for the set of all vertices v of T having or 2 children. 
Then we can find a unique tree [T] such that there exists a bijection u — > w u from [T] onto 
<S(T) that preserves the genealogical order and the lexicographical order of vertices. We call 
this bijection the canonical bijection from [T] onto <S(T). 



1 . ■ I I ',■ t" t £ i i x -^ iii v '■ T 



[ne\ 
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On the left, the tree A, its truncation A F and 



Figure 6: Setting of Proposition 

its skeleton Sk(A e ). On the right, a large reduced tree T of height n, its truncation 

R en (T) and its associated binary tree [i? en (T)]. 



Proposition 15. We can construct the reduced trees T* n and the tree A on the same prob- 
ability space (rijJ 7 , P) so that the following properties hold for every fixed e € (0,1) with 
F-probability one. 

(i) For every sufficiently large integer n, we have R en (T* n ) G Tbin and [R en (T* n )] = 
Sk(A e ). 

(ii) For every sufficiently large n, such that the properties stated in (i) hold, and for every 
u G Sk(A e ), let iu™' e denote the vertex of S(R £n (T* n )) corresponding to u via the 
canonical bijection from [i? en (T*")] onto S(R en (T* n )). Then we have 

lim -\w2' s \=Y u A(l-e). 

n— »oo Tl 

Proposition [15] is essentially a consequence of classical results on the convergence in 
distribution of reduced critical Galton- Watson trees, see in particular [25] and [5]. A simple 
way of proving Proposition [15] is to use the convergence in distribution of the rescaled height 
functions associated with the trees ~P™' towards a Brownian excursion with height greater 
than 1 (see [TSJ Corollary 1.13]). By using the Skorokhod representation theorem, one 
may assume that the trees T*-™) and the Brownian excursion are constructed so that the 
latter convergence holds almost surely. The various assertions of Proposition [15] then easily 
follow, using the relation between the Brownian excursion with height greater than 1 and 
the continuous reduced tree A, which can be found in rjSJ Section 5]. 

Let us comment of the properties stated in Proposition [15] We first note that we have 
Y u > 1 — e if and only if u is a leaf (i.e. a vertex with no child) of Sk(A £ ). The vertex w™' £ , 
which is well defined for u G Sk(A e ) when n is large, does not depend on e, provided that 
u is not a leaf of Sk(A £ ). More precisely, suppose that < S < e, and suppose that n is 
sufficiently large so that the properties stated in (i) hold as well as the same properties with 



e replaced by S. Then if u 6 Sk(A £ ) is not a leaf of Sk(A e ), we must have u>™'' 



w 



n,8 



On 
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the other hand, if u is a leaf of Sk(A e ), then we must have \w™ ,£ \ = n — \en\ , and w" ,e is an 
ancestor of w™' 5 . We leave the easy verification of these properties to the reader. 

4.2.2 Convergence of conductances 

Let T C U be a tree such that h(T) > i, and consider the new graph T' obtained by adding 

to the graph T an edge between the root and an extra vertex d. We let C^(T) be the 

probability that simple random walk on T' starting from hits generation i of T before 

hitting the vertex d. The notation is justified by the fact that Ci(T) can be interpreted as 

the effective conductance between d and generation i of T in the graph T', see [201 Chapter 

2]- 

Proposition 16. Suppose that the reduced trees T* n and the tree A are constructed so that 



the properties stated in Proposition 15 hold, and that the Yule tree T is obtained from A as 
explained in Section\2l^ Then 



nC n {T* n ) -ii-» C(T) . 

n— >oo 

We omit the easy proof, as this result is not needed for the proof of Theorem [l] 

4.2.3 Convergence of harmonic measures 

Our goal is now to verify that the discrete harmonic measures \x n converge in some sense to 
the continuous harmonic measure /i defined in Section |2.1| 
For every x £ dA e = {z £ A : H(z) = 1 — e}, we set 

fj, E (x) = lJ.{{y£dA:x-<y}). 

Similarly, we define a probability measure jj,^ on T* 7 ^, , by setting 

Mn( u ) = Vn{{v G T„ : u -< v}), 
for every u £ T*™ , , . Clearly, [f n is also the distribution of (£n)n-Len| • 
Proposition 17. Suppose that the reduced trees T* n and the tree A have been constructed so 



that the properties of Proposition 15 hold, and recall the notation ( - w" ,e )„ e sk(A e ) introduced 



in this proposition. Then P a.s. for every x = (v, 1 — e) £ dA E 

lim n 6 n (w^ 6 )=^ e (x). 

n— J-oo 

Proof Let 8 £ (0,e) and set T s = inf{i > : H(B t ) = 1 - 8} < T. Define a probability 
measure /j, 5 '^ on dA E by setting for every x £ dA E , 

^ s \x) = P{x<B Ts ). 

Similarly, we write Hn for the distribution of the hitting point of generation n — \_8n\ by 
random walk on T*™ started from 0, and we define a probability measure /4j on T*™ , , 
by setting 

^' ) (f)=/«i i) ({«'6T« N :Mi»}) ) 

for every veT* n n _ len} . 
It is easy to verify that 

limn e '^ ) (x) = n e (x) 

for every x £ dA e , P-a.s. Indeed we have the bound \fi e ^(x) — fJ- e (x)\ < 8/e, which follows 
from the fact that there is probability at least 1 — 8/e that after time Ts Brownian motion 
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will hit the boundary dA before returning to height 1 — e (and if this event occurs then for 
x £ dA e , we have x -< B? if and only if x -< Bt 5 )- By similar arguments, one has P-a.s. 



lim limsup sup 

<5->0 \ n ->co \ v£T 



\^ iS) (v)-nUv)\ 



= 0. 



L=nJ 



In view of the preceding remarks, the convergence of the proposition will follow if we can 
verify that for every fixed 6 £ (0, e), we have a.s. for every x = (u, 1 — s) £ <9A £ , 



lim /#<*>«•*) = M e ' W (*). 



(32) 



By considering the successive passage times of Brownian motion stopped at time T$ in 
the set {(v, Y v A (1 — 6)) : v £ Sk(A^)}, we get a Markov chain X ^ 5 \ which is absorbed in the 
set {(v, 1 — 6) : v is a leaf of Sk(A^)}, and whose transition kernel are explicitly described in 
terms of the quantities Y v , v £ Sk(Aa). 



Let n be sufficiently large so that assertions (i) and (ii) of Proposition 15 hold with e 



replaced by 6, and consider random walk on T*" started from and stopped at the first 
hitting time of generation n — \6n\ . By considering the successive passage times of this 
random walk in the set {w™' S : v 6 Sk(A,j)}, we again get a Markov chain X^' n , which is 
absorbed in the set {w™' S : v is a leaf of Sk(A^)} and whose transition kernels are explicit in 
terms of the quantities |u>™|, v £ Sk(A^). 

Identifying both sets {(v,Y v A (1 - 5)) : v £ Sk(A«5)} and {w^ s : v £ Sk(A«5)} with 
Sk(A,5), we can view X^ and X^ s '' n as Markov chains with values in the set Sk(A^), and 



then assertion (iii) of Proposition 15 implies that the transition kernels of X^' n converge 



to those of X^ s \ Write X^' for the absorption point of X^ s \ and similarly write X ao ' ,n for 
the absorption point of X^' n . We thus obtain that the distribution of X^ converges to 
Consequently, for every u £ V such that x = (u, 1 — s) £ dA £ , we have 

lim P(u^X£>' n ) = P(u^X£>). 



that of x£ ] 



However, from our definitions, we have 

and, for n sufficiently large, noting that w™ ,£ coincides with the ancestor of w™' 5 at generation 
n — \en\ (see the remarks after Proposition 151, 



p(u^x^ n ) = ^ iS) «n- 



This completes the proof of ( 32 1 and of the proposition 



□ 



Recall that (x)i is the ancestor of x at generation i < \x\ 



Corollary 18. Let £ £ (0, 1). We can find £o £ (0, 1/2) such that the following holds. For 
every e £ (0,£o); there exists no > such that for every n > uq we have 



E®£ 



lo g^((£„)™-Le»j) -£log£ 



< ^|loge| : 



Proof. Recall our notation SS^x^r) for the closed ball of radius r centered at a; £ A. Fix 
r] £ (0,1). Since Bt is distributed according to /i, it follows from Theorem [2] that there 
exists £q £ (0, 1/2) such that for every e £ (0,£q) we have 



P[ \o gf i(^ d (B T ,2e))- p log e > fa/2) | log e\ < »j/2 



(33) 
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Let us fix £ £ (0, £o). We now claim that, under P ® P, 

/4«£„)„_ LctiJ ) ^> M (^d(ST,2e)) 



(34) 



To see this, let / be a continuous function on [0,1]. Since the distribution of (S„) n _^ e „j 
under P is [i e n , we have 



E<g>£ 



/(/4«£ n ) n _ LEnJ ))] =e[ Y ^W/«W) 



b.€T" 



Le™j 



By Proposition fl5J we know that P a.s. for n sufficiently large, 



Y H e n {u)f{n e n {u))= Y. ^«)/(/4«)) 



uGT 



n -Le~J 



leaA, 



and by Proposition [17] the latter quantities converge as n — > oo towards 
J] M c »/(M e 0£)) = B[/(^(^d(Br,2e)))]. 



xedA, 



Our claim (34) now follows 



By (33 1 and (34 1, we can find n Q — n (e) > s such that for n > no we have 
P®P(|lQg^«E„) n _ Len j)-0]Qge| >r?|loge|) < n. 

It follows that 



E®E [| log /£((£„)„_ LenJ ) -0 logs 

<r ? 2 |log£| 2 + 7 ? 1/2 E® J B[|logMn((E„)n-Lenj)-/31oge 

<(r ? 2 + 2 ?7 1 / 2 /3 2 )|loge| 2 + 2, 7 1 / 2 E® J B[|log^((S„) n - L e„j 



1/2 



1/2 



(35) 



Let us bound the last term in the right-hand side. It is elementary to verify that the function 
g(r) = (r A e~ 4 ) | log(r A e~ 4 )| 4 is nondecreasing and concave over [0, 1]. It follows that 



E 



l lo g/4«s„)„-L e „j)| 4 = Y /4( u )l lo s/4(w)l 4 

• Y ^HAe- 4 )|log(^( W )A e - 4 )| 4 +4 4 
Y <?(/4M)+4 4 



u£T* 



u£T* 



-[e»J 



<#T;;i L£ „jX. g ((#T;" 



[en] . 



-n ,_ 4 4 



< 



log#T 



n— \£n\ 



2 x 4 4 



We now use Lemma 14 to get 



E®E 



|l0g/4(( S n)n-LenL 



< 2 x 4 4 + E 



iog#T;i 



LenJ 



< 2 x 4 4 + C 4 ( log 



LenJ 



By combining the last estimate with (35), we get that, for every n > no(e) 



E®E 



log^«S„)„- L e„j)-/31oge| 2 l < (n 2 + 2 V 1 / 2 p 2 )\loge\ 2 + 2 v 1 / 2 (2 9 / 2 + C 2 \loge\ 2 ). 



The statement of the corollary follows since r\ was arbitrary. 



□ 
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4.3 Proof of Theorem [T] 

We need a few preliminary lemmas before we can proceed to the proof of Theorem [Tl 

4.3.1 Preliminary lemmas 

Our first lemma is a discrete version of Lemma [6] This result is well known and corresponds 
to the "flow rule" for harmonic measure in |18j . We provide a detailed statement and a brief 
proof because this result plays a key role in what follows. 

We consider a discrete tree T with height n, and we write Z^ > = (Z k )fc>o for simple 
random walk on T starting from (we may assume that this process is defined under the 
probability measure P). We set 

H^=ini{k>0:\Z { J ) \ = n}, 

and si T) = Z {T) m . We let ^ T) be the distribution of ni T) 



fi T) ' 
For < p < n, we set 

4 T) =sup{/c <HP:\ZJp\= P }. 

V 

( T )\ 



Clearly Y> n ' G T[Z%], and therefore Z K ^ = (E 



Lemma 19. Let p G {0, 1, ... , n — 1} and z G T p . Then, conditionally on (£„ ) p — z, the 
process 

\ (4 T) +fe)A^ T) /fc>o 

is distributed as simple random walk on T[z] starting from z and conditioned to hit T[z] (~lT n 
before returning to z, and stopped at this hitting time. Consequently, for every integer q G 
{0, 1, . . . , n — p}, the conditional distribution of 

M l T) (J3 T (sl T) , g )) 
^\Bj{^\n-p)) 

knowing that (S„ ) p = z is equal to the distribution of 

^n-p l-°T[z] \Pn-p ■•'ill- 
Proof. The first assertion is easy from the fact that the successive (non-trivial) excursions of 
Z^> in the subtree T[z] are independent (and independent of the behavior of Z^ T ' outside 
T[z]) and have the same distribution as the excursion of random walk in T[z] away from z. 
We leave the details to the reader. 

Let us explain why the second assertion of the lemma follows from the first one. Clearly 
the distribution of the hitting point of T[z] (~lT n by simple random walk on T[z] starting from 
z and conditioned to hit T[z] nT„ before returning to z is the same as the distribution of the 
hitting point of T[z] (~l T„ by simple random walk on T[z] starting from z. Let fi n be the 
conditional distribution of £„ knowing that (S^) p = z. We get from the first assertion 
of the lemma that /i„ is equal to the hitting distribution of T[z] n T„ for simple random 
walk on T[z] started from z (note that we are here interested in the subgraph T[z] of T and 
not in the "relabelled" tree T[z]). It also follows that, for every integer q G {0, 1, . . . , n — p}, 
the conditional distribution of 
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knowing that (£' ')p = z coincides with the distribution of 

Vn-p K ±i T[is]\, lj n-p 'Q))- 

Now notice that, on the event (£( T )) p — z, fj, n '' z (Bj(T, ( - T \q)) is equal to 

This gives the second assertion of the lemma. □ 

If 1 < i < n, T* n [(T, n ) n _i] is the subtree of T* above generation n — i that is "selected" 
by harmonic measure, and T* n [(S ra )„_ i ] is the tree obtained by relabelling the vertices of 
T*"[(E„)„_i] as explained above. It is not true that the distribution of T* ra [(S n )„_i] under 
P (g) P coincides with the distribution of T" under P - this would be the case if instead of 
considering T*™[(S„)„_ i ] we looked at T* n [z] with z chosen uniformly at random over T*1j, 
but harmonic measure induces a distributional bias. Still the next lemma gives a useful 
bound for the distribution of T*™[(S„)„_j] in terms of that of T*\ We recall the notation 



Ci(T) from Proposition 16 



Lemma 20. For every i e {1, . . . , n — 1} and every nonnegative function F on the space of 
trees, 

E®E[F (T*"[(S„)n-,])] < (* + 1) E[C,(T«) F(T* 1 )]. 

Proof. Fix i G {1, ...,n — 1} in this proof. Recall our notation Ri(T* n ) for the tree T* n 
truncated at level n — i. From the branching property of Galton- Watson trees, one easily 
verifies the following fact: Under P, conditionally on i? i (T* n ), the (relabelled) subtrees 
T* n [w], v £ T*™ i are independent and distributed as T" (to make this statement precise we 
can order the subtrees according to the lexicographical order on T*™ 4 ). 

Consider the stopping times of the random walk Z n which are defined inductively as 
follows, 

t/ n = infjfc > : \Z£\=n-i}, 

V n = mf{k>U?:\Z%\=n- t-1}, 

and, for every j ' > 0, 

U? +1 =vai{k>V?:\ZZ\=n-i} ) 
V? +1 = M{k>U? +1 :\Z%\ = n-i-l}. 

Set WJ 1 = Zy n for every j > 0. Then, under the probability measure P, (M / T)j>o is a Markov 
chain on T*™ i , whose initial distribution and transition kernel only depend on Ri(T* n ). 
Now observe that 

(£„>„_* = Wl 

where jo is the first index j such that 

sup \Zl\ = n. (36) 

U n <k<V n 



If j > is fixed, then conditionally on the Markov chain W n the probability that (36 1 holds 
is Ci(T* n [Wp}). 

Thanks to these observations, we have 

oo j — 1 

E [F (r"[(E„)„_,])] = Y,E[F(T m [W?]) Ci(T* n [W?]) JJ (l - ^(T*"^"]) 

j=0 1=0 
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We then use the simple bound Cj(T) > ^K, which holds for any tree T with height greater 
than or equal to i. It follows that 

1 \J 



E[FCr n [<E„>„_i])] < £ (l - TTl) E [ F (T* n [W?])d(T* n [W?]) 



3=0 



For every u £lA with |u| = n—i, let tt"(u) = P(W™ = u), and recall that 7t?(m) only depends 
on the truncated tree Ri(T* n ). Then, for every j > 0, 



E®E 



F(T* n [W?])C t (T* n [W?}) 



= ::. E ^»^(t*>])c 4 (t*>]) 

ugT™. 

= E[F(T* i )C i (T* i )], 
by the observation of the beginning of the proof. We conclude that 

1 V 



25[FO-* n [<E„>„_,])]<X;(i 

J'=0 



i + 1 



E^CT-JCiO"**)] - (i + l)E[F(T**)Ci(T**)], 



as desired. 

Our last lemma gives an estimate for the conductance Cj(T* J ). 
Lemma 21. There exists a constant K > 1 smc/i tftat, /or every integer n > 1, 

E[C„(T*") 2 ] < ^ 



D 



(n+1) 2 

Proof. Obviously we can assume that n > 2, and we set j = [n/2\ > 1. An immediate 
application of the Nash- Williams inequality J2U[ Chapter 2] gives 



c„(t*") < 



#T« 



(just consider the cutsets obtained by looking for every integer i £ {1, ... ,j} at the collection 
of edges of T*™ between generation £ — 1 and generation £). Then, 

E[(#T*") 2 ] = E[(#{v £ Tf : MT (0) M) > « - j}) 2 I MT (0) ) > n] 
= tfnWiv G Tf : MT (0) H) >n-i}) 2 ]. 
As we already observed in the proof of Lemma Pul the conditional distribution of #{v £ 



follows that 



-(0) 

3 



T a : ft.(T(°)[v]) > n — j} knowing that #T^ — k is the binomial distribution B(k,q n -j)- It 



E[(#{v £ Tf : MT (0) H) > n - j}) 2 ] = ^- i E[(#Tf ) 2 ] + («„_, - ^_.) E [#tJ° } ] 



2 2 ■ , / 2 \ 



We conclude that 



E 



[C„(T™) 2 ] < (i 2 ^)- 1 (^CX 2 ? + <?„_,), 



and the statement of the lemma follows from (31 1 



□ 
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4.3.2 Proof of Theorem [Q 

We will prove that 

E <g> E [|log //„(£„) + /31ogn|] = o(logn), as n — » oo. 

Theorem fl] follows, since ([37]) and the Markov inequality give, for any S > 0, 



(37) 



P(|log^„(E n ) + /31ogn| > Slogn) 



+ 0, 



and therefore 



E 



P(/x n (S„) < n 



-p-6 



or 



Mn(S„) > n-^+ 5 ) 



->0. 



Since by definition /i n is the distribution of S„ under P, the last convergence is equivalent 
to the first assertion of Theorem Q] 

Fix £ > and let e > and no > be such that the conclusion of Corollary [18] holds for 
every n> n^. Without loss of generality, we may and will assume that e = 1/N, for some 
integer N > 4, which is fixed throughout the proof. We also fix a constant a > 0, such that 
a log N < 1/2. 

Let n > N be sufficiently large so that JV"L alogn J > no _ We then let i? > 1 be the unique 
integer such that 

N l <n<N e+1 . 



Notice that 



logn 

logiV 



1 <£< 



log n 
log AT' 



Our starting point is the equality 

/i„(E„) , A 1 /i„(B(S„,iV^ 1 )) 



log^„(E„) =log 



/x n (B(S„,iV)) + ^ log ' M „(B(S„,iVJ)) 



(38) 



+ togA* n (B(E n ,i\T / )). (39) 



To simplify notation, we set 



A'l 



log 



A i n(^ J n) 



A" = 10. 



An 



H n (B(H n ,N)) 

M „(B(£„,7V-i)) 
log //„(£(£„, i\T')) + /31og(n/iV< 



/31ogiV, 

j3 log AT for every j £ {2, . . . , £}, 



From ( 39 1 , we see that 

E® £ [|log /*„(£„) + /31ogn|] = E®E 



e+i 

3=1 



<£E®£[L4?|] 



We will now bound the different terms in the sum of the right-hand side. 



First step: A priori bounds. We first verify that, for every j £ {1,2, 



have 



E (g> E[A%] < (CVK + /3) log N, 



(40) 



1}, we 
(41) 



where C is the constant in Lemma |14[ and K is the cons tant in Lemma |21| Suppose first 
that 2 < j < I. Using the second assertion of Lemma 19 with p = n — N 3 and q = N 3-1 , 
we obtain that, for every z £ T^ NJ , the conditional distribution of A T - under P, knowing 
that (£ n )„_j\rj = 2 i i s the same as the distribution of 

log$P lz]) (B(X%r iz]) ,W- l ))+piogN, 
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Recalling that fi NJ is the distribution of SL^ under P, we get 



E[\A*\ 



,(T*"[z]) 



log^. ™(*(E& ™, iV^ 1 )) 



(T*"N) 



■/31ogiV 



= -G,-(T*"[z])+/?logJV, (42) 

where for any tree T with height N 3 ' , 

Gj(J) = J ^)(dy)\log^](B T (y,N^ 1 ))\ = £ tf$(f Ni -i[z])\logp$(f N ^[z]) 



NJ -N3- 1 



In the latter form, Gj(T) is just the entropy of the probability measure that assigns mass 
[V N J (Tjyi-i[z]) to every point z € T^-am-i. By a standard bound for the entropy of 
probability measures on finite sets, we have Gj(T) < log#Tjy 3 '_ W 3-i for any tree T with 



height N 3 . Recalling ( 42 1 , we get 



E®E[\A?\]<E®E\\0 g #T*$_ Ni _ 1 [(V n ) n _ Ni ]]+01 OS N 



< (AP + 1)E C^O*^) log#T^ 3 _ w -_ ] 



+ /31ogiV 



*A«n\2' 



<(N 3 + 1)E (C N >(T* N )) 



(log#T* I ^ l _ NJ - 1 



1/2 



< VKE 



1/2 



E[(log#T^ JV3 _ 1 )' 
■/31ogiV, 



1/2 



•/3 log TV 



using successively Lemma |20| the Cauchy-Schwarz inequality and Lemma |21| Finally, 
Lemma [14] gives 



E 



.log#T#Ljvi-i) 



1/2 



< Clog TV, 



and this completes the proof of ( |41| when 2 < j < t. 

The cases j = 1 and j = £+ 1 are treated on a similar manner. For j = £+ 1, we observe 
that the same entropy bound gives 

E[\]og^(B(Z n ,N t ))\] = Yl Mn(f*"[y]) |log M „(T*™[2/])| < log#T™^. 

y6 T r „/ 



It follows that 



E®E[\logHn(B(Z n ,N e ))\] <E[log#T™ wf ] < ClogiV, 



by Lemma 14 and using the fact that N l < n < N e+1 . 

Finally, for the case j = 1, we use exactly the same argument as in the case 2 < j < £, 
to get 



E 



log 



<£ [log #!-;"[(£„ 



-AT 



/i n (B(E n ,J\0) 

and we obtain similarly, using Lemma |20[ Lemma |2l] and Lemma |14| 

E ® £ [log#T; n [(S n )„_^]] < (TV + 1) E [CjvCT*^) log #7^ 

<^E[(log#T*/) 2 ] 1/2 
< CVKlogN. 



This completes the proof of ( 41 1 



Second step: Refined bounds. We will get a better bound than (41 1 for certain 
values of j. Precisely we prove that, if [adognj < j < £, we have 



E<g>E[|A7|] < x/CKlogN. 



(43) 
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Let us fix j € { [a log nj ,...,£} . Recall that we have then N 3 ' > tiq . From ( 42 ) , we have 

E[\AJ\}=E[F j (T* n [('Z n ) n _ NJ })} ) (44) 

where, if T is a tree with height N 3 , 

F j (T) = \j3logN-G j (T)\ = J i${dy) {\og^]{B T {y,Ni- l )) + p\ogN) 

Using Lemma 20 as in the first step, we have 

E®E[\A]\]=E®E{F j (T* n [(X n ) n _ Ni ])] < (N 3 + 1)E [c Ni (J* Ni )F j (J* Ni j . 
We then apply the Cauchy-Schwarz inequality together with the bound of Lemma |2T] to get 



<VK -E 



H Ni {dy) \\og^ ND {B{y,N^ l )) + p\ogN\ 



1/2 



lt Ni {dy) |log M Ar J (S(y,iV^ 1 )) + /31ogiV 

1/2 



-,1/2 



< VK-E 

= VK-E®E\\logpi N] (B(Y, N] ,N : >- 1 )) + piogN\ 

= VK-E®E |log M ] v / f((S JVJ ) JV3 _ JVJ - 1 )+/31ogiV 

where the last equality follows from the definition of the measures \x e n at the beginning of 
Now recall that 1/N = e and note that N 3 - A"" 1 = N j - eN 1 . Since we 



1/2 



4.2.3 



Section 

have N 3 > uq , we can apply the bound of Corollary [18] and we get that the right-hand side 



of the preceding display is bounded above by \/£,K log N, which completes the proof of (43 1 



By combining (41 1 and (43 1, and using (ktOk, we arrive at the bound 



E®.E[|log^„(S„)+^logn|] < ^ log n\ (CVK + 0) log N + I y/^K log N 

< (a{CVK + p) log N + y/(K\ log n, 

which holds for every sufficiently large n. Now note that £ > can be chosen arbitrarily 
small. The choice of £ determines the choice of N, but then we can also choose a arbitrarily 



small given this choice. We thus see that our claim (37) follows from the last bound, and 
this completes the proof of Theorem [T] 

5 Complements 

5.1 A different approach to the continuous results 

In this section, we briefly outline another approach to Theorem [2] which is based on a 
different shift transformation on the space T*. Informally, if (T, v) G T* , we let S(T,v) be 
obtained by shifting (T, v) at the first node of T. More precisely, if T corresponds to the 
collection (z„).„ e v, and v = (vi,t>2, ■ • •)) we se t 

S(T,v) = (T M ,v) 

where v = (t>2, t>3, ■ ■ •) and, for i = 1 or i — 2, Tu) is the tree corresponding to the collection 



[z,i V — z ) ve v, in agreement with the notation of Section 3.4 
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Proposition 22. For every r > 1, set 



k(t) 



l(dsMdt)- 



r+s+t-1 

The finite measure k(C(T)) ■ Q*{dT dv) is invariant under S. 
Proof. Let Fbea bounded measurable function on T*. We have to prove that 

/ ' FoS(T,v)n(C(T))e*(dTdv)= I ' F{T,v) K {C{T))<d*{dTdv). 



(45) 



Recall that Q*(dTdv) = Q(dT)i / -r(d'v) by construction. If we fix T G T, the distribution of 
the pair (v\ , v) under vj- is given by 



/ v T (dv)l {vi = i} g(v) = r( ^'^' r [ (rr ^ J VT (t) {du)g(u) 



C{T (i) ) 
C{T {1) )+C{T {2) ) 



where i £ {1,2} and g is any bounded measurable function on {1,2} N . It follows that the 
left-hand side of ( 45 1 may be written as 



J2 f F(r {i) ,xi)K(C(T)) 7 ^^ r 9(ri7> T(i) (rfu). 



(46) 



We then observe that under Q(dT) the subtrees 7(i) and 7(2) are independent and distributed 
according to 0, and moreover we have 



cm = ( 



u - 



l-U 



C{T (1) )+C{T m , 



where U is uniformly distributed over [0,1] and independent of (7m, T(2))- Using these 
observations, and a simple symmetry argument, we get that the quantity (46 1 is also equal 
to 



C(T) 



2j^ dxj e(dT)e(dr)v T (du)F(T,u) C(TiC . T 

»i 



-) K (( a 



1 -x 



B'l./Tf/ulFlT.u) (2/ dxjQ{dT') C{T]CrTI] 



C{T)+C{V) 
C{T) // !-.<■ 



7y-(( a 



C(T)+C(T') 



Hence, the proof of (45) reduces to checking that, for every r > 1, 



(47) 



To verify (47), let Cq,Ci,C2 be independent and distributed according to 7, and let U be 



uniformly distributed over [0, 1] and independent of {Cq,C\,C2) under the probability measure 
P. Note that by definition, for every x > 1, 



k(x) = E 



xC\ 
x + d + C 2 - 1 



39 



It follows that the right-hand side of ( 47 1 can be written as 



2E 



^( u+ ^y 



r + C 
-2rE 
-rE 
-rE 

-rE 



- 1 






(Ci + C 2 - l)(U(C + r) + l-U) + C + r 



(d + C 2 - l)(J7(Co + r) + 1 - U) + C + r 

C1+C2 

(Ci +C 2 )(U{C + r - 1) + 1) + (Co + r - 1)(1 - U) 

1 



rE 



(C + r-l)(c/ +5 ^_)+l 

C 

r + Co+C-l 



where C = {U+ r~+r )~ X ■ ^ ©> ^ ^ s distributed according to 7. Since C is also independent 
of Co, we immediately see that the right-hand side of the last display is equal to n(r), which 
completes the proof of (47 1 and of the proposition. □ 



One can verify that the shift 5* is ergodic with respect to the invariant probability measure 
obtained by normalizing k(C(T)) ■ Q*(dTdv) (we omit the proof). One then applies the 
ergodic theorem to the two functionals defined as follows. First we let Z n (T, v) denote the 
height of the n-th branching point on the geodesic ray v. One immediately verifies that, for 
every n > 1, 



If A = / n(C(T))0*(dTdv), it follows that 

Zi(T,v)«(c(7))e*((rrdv). 

Note that the limit can also be written as 



-Z n -^f A- 1 



(48) 



A _1 E 



|log(l-t/)|«((V+^ 



1 -U \-' 
C2 



with the notation of the preceding proof. Secondly, if x„, v stands for the the 71+ 1-st branching 



point on the geodesic ray v (with the notation of Section 2.2 x„ jV = ((vi, . . . , v n ), Z n+ i(T, v)) 
if v = (vi,V2, ■ • •))> we set f° r every n > 1, 

ff„(T,v) = log^ r ({u G {1,2} N : x n , v -< u}). 

It is then also easy to verify that 

n-l 



tf„ = £ H x o S* 



and we have thus 



i=0 



-H n e ^A s - A- 1 I H x {T,v)K{C{T))Q*{dTdv). 

ft n—toc 



(49) 
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The limit can be written as 
2A~ 1 E 



cT^ log (c^) K (( c/+ ^T^) ') 



By combining ( 48 1 and ( 49 ) , we now obtain that the convergence (TTj) holds with limit 



2E 



& 



Ci+C 2 



log 



Ci 



Ci+C 2 



k[[U + 



1-U 

C1+C2 



E 



bg(l-!7)|«((l7+£ 



+c 2 



-1 



We leave it as an exercise for the reader to check that this is consistent with the other 
formulas for j3 in Proposition [3] 

5.2 Supercritical Galton— Watson trees 

One may compare our results about Brownian motion on the Yule tree to the recent paper of 
A'idekon [TJ, which deals with biased random walk on supercritical Galton- Watson trees. To 
this end, consider the supercritical offspring distribution $( n ' given by 0( n >(l) = 1 — - and 
0(")(2) = i. If T (n) is the (infinite) Galton- Watson tree with offspring distribution 9^, 
then T^ n \ viewed as a metric space for the graph distance rescaled by the fact n _1 , converges 
in distribution in an appropriate sense (e.g. for the local Gromov-Hausdorff topology) to 
the Yule tree V. 

Consider then the biased random walk [Zri )k>o on f^ n ) with bias parameter A^ n ^ = 1 — - 
(see e.g. [TJ5] or [JJ for a definition of this process). Since the "mean drift" of Z^ n > away 
from the root is ^ + o(n~ l ), it should be clear that the rescaled process (Z^ 2t i)t>o is 
asymptotically close to Brownian motion with drift 1/2 on the Yule tree, in a sense that can 
easily be made precise. 

An explicit form of an invariant measure for the "environment seen from the particle" 
has been derived by A'idekon [TJ Theorem 4.1] for biased random walk on a supercritical 
Galton- Watson tree (see also [TT] for a related result in a different setting). In the unbiased 
case such an explicit formula already appeared in the work of Lyons, Pemantle and Peres 
[12], but in the subsequent work of the same authors [JJ5] dealing with the biased case, only 
the existence of the invariance measure was derived by general arguments. It is tempting to 
use Aidekon's formula and the connection between the A^-biased random walk on T*-™' and 
Brownian motion with drift \ on the Yule tree to recover our formulas for invariant measures 
in Propositions [TT] and [22] Note however that the continuous analog of Aidekon's formula 
would be an invariant measure for the environment seen from Brownian motion on the Yule 
tree at a fixed time, whereas we have obtained invariant measures for the environment at the 



last visit of a fixed height (Proposition 11) or the last visit of a node of the n-th generation 
(Proposition 22 1. Still the reader should note the similarity between the limiting distribution 
in [TJ Theorem 4.1] and the formula for the invariant measure in Proposition 22 Indeed, we 
were able to guess the formula for k in Proposition 22 from a (non-rigorous) passage to the 
limit from the corresponding formula in [1 . 
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